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MARGINAL PERCENTAGES IN MULTIWAY TABLES 
OF QUANTAL DATA WITH DISPROPORTIONATE 
FREQUENCIES 


F. Yates 
Rothamsted Experimental Station, 
Harpenden, Herts., England 


SUMMARY 


When multiway tables of quantal data are analysed by transforming 
the percentages and fitting constants to the transformed variate by 
the method of maximum likelihood, direct inverse transformation of 
the values of the fitted constants gives percentages which may deviate 
widely from those observed. The reasons for this are explained and 
methods are given for obtaining correct percentages. 


INTRODUCTION 


Transformations, such as the logit and log log, are now being used 
increasingly for the anaiysis of multiway tables of quantal data. The 
basic procedures have been described by Jolly [1950], Dyke and Pat- 
terson [1952] and Yates [1955]. Their earlier use for this purpose was 
undoubtedly inhibited by the heavy numerical computation required, 
but the burden of this has been greatly lightened by the introduction 
of electronic computers. The Rothamsted computer, for example, has 
a programme for analyses of this type which will handle tables with 
up to five factors in which the number of cells and marginal means 
does not exceed 256. Constants representing the main classifications 
and any desired sets of sub-classifications can be fitted, and the values 
of the constants and the reduction in variance are determined. 

When an analysis of this kind has been completed, it is frequently 
desirable to present the results in terms of percentages, instead of, 
or as well as, in terms of the transformed variate. If the constants 
obtained in the analysis are retransformed directly to percentages, 
considerable distortions may occur. It is the purpose of this paper to 
explain why such distortions arise and how they may be removed. 


ESTIMATION OF A MEAN PERCENTAGE BY MEANS 
OF A TRANSFORMATION 


It is well known that, if a set of percentages is transformed by a 
non-linear transformation such as the logit transformation, aad the 
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mean of the transformed values is retransformed back to a percentage, 
this percentage will not equal the mean of the orginal percentages. 
The ordinary maximum likelihood procedure of fitting, using pro- 
visional and working values and successive approximation, removes 
this distortion. 

This result follows immediately from the fact that the weighted 
mean of a set of percentages, with weights proportionai to the fre- 
quencies, is the maximum likelihood estimate of the mean percentage. 
A direct proof may, however, be of interest, and iilustrates the pro- 
cedure te be followed in the more complicated case of two-way tabies, 
discussed in the next section. 

Suppose we have a set of proportions p, = z./n, , where n, is the 
number of observations in class r and z, the number of those tuifilling 
the required condition, so that 100 p, is the corresponding percentage. 

If the observations in ll the classes are in fact samples from the 
same population, the sufficient estimate of the popuiation proportion 1s 


= 2,/ >, n, = > np,/ > 


If, with any transtormation y = f(p), with inverse p = F(y), a common 
provisionai value Y is taken, and Y,,,, and Yj, ave the maximum and 
minimum values and w the weighting coefficient for this value of Y 
(as defined in the ordinary maximum likelihood procedure), then the 
working vaiue for class r will be 


The weighted mean of these values, which will provide a first estimate 
of the adjusted mean, and also a new provisional value, will be 


n,w n, max n, mio 


If successive approximations are now carried out until there is no change 
in the provisional value, we shall arrive finally at the equations 


7] Y, 


(1) 


where Y, Y 4. and Y,,;, are the final vaiues of Y’ and the corresponding 
maximum and minimum. If P = F(Y), the formulae for Y,,,,. and 
Yimin ore 
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dY 
dY 
Yin = — Pop 


(See, for example, Fisher and Yates [1957], p. 14.) Hence equation (1) 
reduces to #Q — = 0,i.e. P = and therefore # = F(§). In other 
words the inverse transformation of the maximum likelihood estimate 9 
gives an estimate # which is equal to the weighted mean of the original 
percentages, with weights proportional to the frequencies. 


TWO-WAY TABLES 


In the analysis of a multi-way table of quantal data the expected 
values of the transformed variate are assumed to be made up of additive 
components representing the various classifications which require to 
be taken into account. 

Thus in an R X S two-way table the expected value of the trans- 
formed variate for the cell (r, s) is given by 


= 


where m is a general mean, a, , --- , a, is the set of constants for the 
first classification, and b, , --- , b, the set for the second classification. 
The values of m and the a’s and 6’s are obtained by the maximum 
likelihood fitting. ’ 

If the 9,, are transformed back to percentages 100 #,, , then 100 #,, 
will represent the expected percentages on the assumed hypothesis 
without distortion. Examination of the equations which are used to 
determine the fitted constants, on the lines of the last section, shows 
that in the case of the logit transformation the weighted means of 
~,. , with weights equal to the original frequencies, will reproduce the 
marginal percentages of the original table exactly. This depends on 
the fact that for this transformation 


dY 


oP = const, 


which follows from the general formula for w, namely 


With other transformations there will be some discrepancy. There 
is, however, no real inconsistency; the marginal percentages so obtained, 
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not those observed, are the appropriate estimates of these percentages 
for the adopted model, with the observed weights. 

To represent the estimates of the class-differences in terms of per- 
centages in compact form, we require the expected percentages of the 
margins of the table with some form of proportionate weights. As 
in the quantitative case it is usually best to give the percentages that 
would be expected if the frequencies in the cells of the table were 
proportionate to the marginal frequencies, i.e. 


ni, =n, n,,/n.. , 


where 
s 


These percentages can be computed directly from the percentages 
100 #,, with the appropriate weights, e.g. 


A 


p.n.,/n,, 


We will refer to this procedure as computation from the expected cell 
percentages. 

Computation of the expected marginal percentages from the expected 
cell percentages, if done by hand, is somewhat tedious, particularly in 
large multiway tables, since the expected percentages of the individual 
cells have first to be computed and their weighted means then taken. 
The question therefore arises whether the values of the fitted constants, 
which are given directly by the computation, can be used to provide 
values of expected marginal percentages. 

In the analogous analysis of quantitative data the values of the 
fitted constants give direct estimates of the values the marginal means 
would assume if the cell frequencies were proportionate to the marginal 
frequencies. With exactly proportionate cell frequencies the values of 
fitted constants m + a, and m + b, are given directly by the two sets 
of marginal means. In the case of quantal data, however, retrans- 
formation of the values of the fitted constants m + a, and m + 5, 
will give percentages which may differ widely from those obtained by 
computation from the expected cell percentages. 

These differences are due to two causes. In the first place, since 
the fitted constants are in effect weighted means of the expected cell 
values of the transformed variates, the corresponding percentages will 
differ from the means, with the same weights, of the cell percentages. 
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In the second place the weights used in the fitting are the products 
of the cell frequencies and the weighting coefficients. The resultant 
fitted constants are therefore estimates of what the marginal means of 
the transformed variate would be if the weights were proportionate to 
the marginal sums of these weights instead of to the marginal fre- 
quencies. If the angular transformation, which has a constant weighting 
coefficient, is used, no differences will arise from this cause, but with 
other transformations it is frequently the major source of discrepancy. 
~ At first sight it might appear that the best course would be to seek 
a value m’ such that the quantities m’ + a, when transformed give 
percentages whose weighted mean, using the marginal frequencies as 
weights, is equal to the observed overall percentage. This can be done 
without difficulty by successive approximation, and appears to be 
reasonably satisfactory with the angular transformation. When the 
percentages are small and the Jogit transformation is used, a much 
closer approximation to the marginal percentages computed from the 
expected cell percentages is obtained by the very simple procedure of 
transforming m + a, to percentages and multiplying all these per- 
centages by a common factor, chosen so as to give the observed overall 
percentage, as is illustrated in the example below. This is to be expected 
since at the lower end of the percentage scale the use of the logit trans- 
formation implies that the cell percentages are representable by a 
product function of the form u, », . 

It may be noted here that even with the logit ‘transformation the 
overall percentage computed from the expected cell percentages will 
differ from the observed overall percentage, owing to the change from 
actual to proportionate frequencies in the cells. The difference is not 
likely to be large, but for practical purposes it may be regarded as 
preferable to present marginal percentages whose weighted means are 
exactly equal to the observed overall percentage. For this purpose 
a proportional adjustment of all percentages (or their differences from 
100 when all percentages are large) will be adequate. 


EXAMPLE 


Table 1 gives the percentages of cows affected by milk fever in given 
lactations, classified by lactation number and season of calving, and 
Table 2 the numbers of calvings on which these percentages are based. 
These results were obtained in the course of a survey of diseases of 
dairy cattle. 

It is immediately apparent from the percentages that there is a 
much greater incidence of milk fever in the later calvings, and also 
a variation with season, January-April having the lowest incidence and 
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TABLE 1 
PERCENTAGES OF COWS AFFECTED BY MILK FEVER 
Lactation 
1-2 3 4 5 6+ ‘| Overall 

Jan-Apr 0.42 | 1.45 | 3.15 | 5.57 | 6.07 2.67 
May-July 0.17 | 3.08 | 7.37 | 9.43 | 10.94 4.17 
Aug-Sept 0.66 | 4.89 | 9.40 | 10.93 | 13.98 4.25 
Oct-Dec 0.35 | 2.93 | 6.07 | 9.61 | 12.19 3.50 

Overall 0.41 | 2.75 | 5.58 | 8.06 | 9.38 3.46 


August-September the highest. The August-September maximum is 
somewhat masked in the marginal percentages, however, owing to the 
greater proportions of these calvings in the earlier lactations. 

Constants were fitted using the logit transformation with logs to 
base 2. The results shown in Table 3 were obtained. Inverse trans- 
formation of these logits gives percentages for seasons which are very 
much too high. This is mainly due to the low weighting coefficients 
associated with the low percentages of lactations 1 — 2. 

The expected logits for the individual cells can be computed from 
the values of Table 3, and transformed to percentages. Thus the logit 
for the top left-hand cell is —4.0121 — 2.3786 + 1.9874 = —4.4033, 
and the corresponding expected percentage is 0.2229. The discrep- 
ancies between the weighted marginal means of these percentages, 
with weights equal to the original cell frequencies (Table 2), and the 
observed marginal percentages, are all less than 0.001. 


TABLE 2 
NuMBERS OF CALVINGS 
Lactation 

* 1-2 3 4 5 6+ Total 

Jan-Apr 3806 2137 1744 1184 2010 10881 
May-July 2352 1006 706 488 841 5393 
Aug-Sept 3169 | 1003 617 366 522 5677 
Oct-Dec 5117 1740 1252 791 1042 9942 
Total 14444 5886 4319 2829 4415 31893 
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Frrrep Constants (Loaits To BasE 2) AND CORRESPONDING PERCENTAGES 


TABLE 3 


Lactation m + ar % Season m+b, % 
1-2 —4.0121 0.383 Jan— | —2.3786 3.566 
3 —2.5643 2.779 May- —1.9180 6.544 
4 —2.0055 5.840 Aug- — 1.6682 9.009 
5 —1.7082 8.564 Oct- —1.9104 6.609 
6+ —1.5678 10.220 


Table 4 gives the various estimates of the expected marginal per- 
centages with proportionate weights. Line (1) gives the weighted means- 
of the expected cell percentages, with weights equal to the marginal 
frequencies of Table 2. These give a small discrepancy in the overall 
percentage, and this is removed in line (2) by proportional adjustments. 
Line (3) is obtained by proportional adjustment of the percentages 
derived directly from the fitted constants (Table 3). 


TABLE 4 


MARGINAL PERCENTAGES COMPUTED BY VARIOUS METHODS 


(1) From expected cell percentages, with proportionate weights 


(2) Line (1) adjusted to observed overall percentage 
(3) From fitted constants (adjusted) 


The agreement 


Lactation 

1-2 3 4 5 6+ Overall 
(1) 0.39 2.79 5.85 8.55 10.18 3.65 
(2) 0.37 2.65 5.55 8.11 9.66 3.46 
(3) 0.36 2.63 5.54 8.12 9.69 3.46 

Season 

Jan May Aug Oct Overall 
(1) 2.19 3.99 5.46 4.03 3.65 
(2) 2.08 3.78 5.18 3.82 3.46 

2.06 3.46 
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between lines (2) and (3) is very close. With the logit transformation 
and small percentages direct derivation from the fitted constants may 
therefore be regarded as quite satisfactory. 

Comparison of these expected percentages with the observed per- 
centages of Table 1 shows that the distortions in the observed marginal 
percentages due to disproportionate weights have been removed by 
the fitting. 

The residual sum of squares given by the fitting is 17.57. This 
corresponds to a x” with 12 d.f., giving .2< P < .1. The data are 
therefore reasonably represented by additive components in the logit 
scale. 


MULTI-WAY TABLES 


If only constants representing the main classifications are fitted 
the procedure of proportional marginal adjustment is exactly the same 
for three- or more-way tables as for two-way tables. If, however, 
constants representing certain sets of sub-classes (analogous to the 
interactions of a factorial experiment) are also fitted, the situation 
becomes more complex. Nevertheless if the expected values of the 
transformed variate are available only for the margins of the table, a 
procedure analogous to that given above for a two-way table will 
probably be quite satisfactory, but certain minor inconsistencies in the 
margins relating to the main classifications may be expected. 


PROGRAMMING POINTS IN ELECTRONIC COMPUTATION 


Any programme for the analysis of multiway tables of quanta! data 
which follows the ordinary maximum likelihood procedure of successive 
approximation must embody provisions for computing the expected 
values, in terms of the transformed variate, of the individual cells of 
the table, since these values constitute the provisional values for the 
next approximation. It must also contain provision for forming the 
weighted marginal means, with disproportionate and probably also 
with proportionate weights. Consequently, in order to calculate the 
expected cell percentages all that is necessary, in the way of additions 
to the programme, is to re-calculate the expected values of the individual 
cells after the last approximation (this will probably be done in any 
case), transform these to percentages, and calculate the marginal means 
of these percentages with weights proportional to the marginal fre- 
quencies of the main classifications. If desired a proportional adjust- 
ment can be made to these percentages so as to give the observed 
overall percentage. 

This procedure is likely to be just as simple, from the programming 
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point of view, as the alternative procedure of adjusting the marginal 
percentages proportionally and should therefore be adopted when writing 
a complete programme. The latter procedure is, however, of interest, 
since it makes it possible to deal expeditiously with results furnished 
by a programme for which no provision for inverse transformation has 
been made, or which only inversely transforms the expected marginal 
values of the transformed variate. 
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SOME CLASSIFICATION PROBLEMS 
WITH MULTIVARIATE QUALITATIVE DATA’ 


G. CocHran 


Department of Statistics, Harvard University, 
Cambridge, Massachusetis, U.S.A. 


AND 


Cart E. Hopkins 
Department of Biostatistics, University of Oregon Medical School, 
Portland, Oregon, U.S.A. 


1. INTRODUCTION 


Since 1935, when Fisher’s discriminant function appeared in the 
literature, methods for classifying specimens into one of a set of uni- 
verses, given a series of measurements made on each specimen, have 
been extensively developed for the case in which the measurements are 
continuous variates. This paper considers some aspects of the classifi- 
cation problem when the data are qualitative, each measurement taking 
only a finite (and usually small) number of distinct values, which we 
shall call states. Our interest in the problem arose from discussions 
about the possible use of discriminant analysis in medical diagnosis. 
Some diagnostic measurements, particularly those from laboratory tests, 
give results of the form: —, + (2 states); or —, doubtful, + (3 states); - 
or (with a liquid), clear, milky, brownish, dark (4 states). . 

With qualitative data of this type an optimum rule for classification 
can be obtained as a particular case of the general rule (Rao, [1952], 
Anderson, [1958]). The rule is exceedingly simple to apply (Section 2). 
In practice, qualititative data are frequently ordered, as with —, 
doubtful, +. The classification rule discussed in this paper takes no 
explicit advantage of the ordering, as might be done, for instance, by 
assigning scores to the different states so as to produce quasi-continuous 
data. The best method of handling ordered qualitative data is a subject 
worth future investigation. 


i1This work was assisted by a contract between the Office of Naval Research, Navy Department, 
and the Department of Statistics, Harvard University by a Special Research Feilowship of the Na- 
tional Institutes of Health, and by a gift from the Lilly Research Laboratories to the Department 
of Biostatistics, University of Oregon Medical School. Reproduction in whole or in part is permitted 
for any purpose of the United States Government, 
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This paper seeks answers to three problems that arise in the use of 
the proposed rule. 


(1) The effect of the initial sample sizes on the performance of the proposed 
classification rule. 


In order to construct a rule, we must have preliminary data on some 
specimens known to be classified correctly, since the rule depends on 
the joint frequency distributions of the measurements within each 
universe. Standard classification theory assumes that these distributions 
are known exactly,.although in practice it is sometimes difficult to 
obtain adequate samples from which to estimate them. The conse- 
quences of constructing a rule from preliminary samples of finite sizes 
are discussed in Sections 3, 4, and 5. 


(2) The relative discriminating power of qualitative and continuous variates. 


-Classification is simpler with qualitative than with continuous 
measurements. For this reason, if a few of the available measurements 
are continuous while the rest are qualitative, we may be inclined to 
transform each continuous measurement into a qualitative one by 
partitioning its frequency distribution, provided that this does not 
result in too much loss of discriminating power. This consideration led 
us to investigate the questions: if a normally distributed variate is 
transformed into a qualitative one with s states, how much discrimi- 
nating power is lost, and what are the best points of partition of the 
curve from the point of view of retaining maximum discriminating 
power (Sections 6 and 7)? 


(3) Use of classification experience for improvement of the rule. 


The optimum rule depends on the relative frequencies 7, , 72 , 73 etc. 
with which specimens from the different univefses present themselves 
for classification. The initial estimates of these frequencies, made from 
previous data or by judgement, may be biased. After a number of 
specimens have been classified by the rule, it is possible to re-estimate 
the frequencies 7, from the data for these specimens, so that the classi- 
fication rule may be improved (Section 8). 


2. AN OPTIMUM RULE FOR CLASSIFICATION 


Ideally, the setting up of an optimum rule demands three different 
kinds of preliminary information. Firstly, we require the joint frequency 
distribution of the measurements in each universe under consideration. 
With continuous variates the most common assumption about these 
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distributions is that they are multivariate normal with the same dis- 
persion matrix in each universe. With qualitative variates, suppose 
that the jth measurement takes s; distinct states. When this measure- 
ment is made on a specimen, we learn into which state it falls. A 
series of qualitative measurements, e.g. one with 2 states, one with 3 
states and one with 4 states, defines 24 cells or multivariate states. 
For any specimen, these three measurements tell us which of the 24 
states the specimen occupies. Thus, in general, a set of k qualitative 
measurements classifies the specimen into one out of a number S = 
88, ++ 8 multivariate states. Consequently the joint frequency dis- 
tribution of the measurements is completely specified for the uth uni- 
verse if we know the probabilities p,, that the specimen falls into the 
ith multivariate state where, for each u, 


8 
Pe = 1. 


t=1 


Secondly, practical use of a classification rule is likely to result in 
some mistakes in classification. In some applications certain types of 
mistakes are more serious than others. If the relative costs of different 
kinds of mistakes can be estimated, the rule should take them into 
account. Let c,, be the cost incurred when a specimen which actually 
belongs to universe V is classified as belonging to universe U. 

Thirdly, when the rule is put into use, the specimens to be classi- 
fied may not come with equal frequency from the universes. For 
instance, in a diagnostic screening test, most of the subjects who present 
themselves may be free from the disease in question. Let x, denote 
the relative frequency with which specimens come from the uth universe, 


where 
m= 1. 


As might be expected, the optimum rule depends on these frequencies. 

With this background we now construct an optimum rule. For a 
specimen which falls into the ith multivariate state, we select the 
universe to which it is to be assigned by minimizing the expected cost 
of mistakes in classification. Suppose that the specimen is assigned 
to the uth universe. For specimens that actually come from this 
universe, no mistake is made. Specimens from the vth universe will 
present themselves with relative frequency x, , and will fall into the 
ith state with relative frequency x,p,, . The expected cost of mis- 
classifying these specimens into the uth universe is therefore z,p,;c,, . 
Hence the total expected cost of mistakes in classification is 
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To apply the optimum rule we compute this quantity for every u 
and assign specimens to the universe for which the quantity is a mini- 
mum. 

There are several particular cases of the rule. If c,, = c, , ie. 
the cost of misclassifying depends only on the universe from which the 
specimen comes and not on that into which it is misclassified, the total 
expected cost over all m universes is 


Since the first term on the right does not depend on u, specimens falling 
into the ith state are assigned to the universe for which the:triple 
product z,p,.¢, is greatest. 

If the relative costs c, are taken as equal, the rule minimizes the 
expected frequency of mistakes in classification. If, further, the relative 
frequencies x, are also equal, any specimen found in the ith state is 
assigned to the universe u for which the conditional probability p,, 
is greatest. 

For a numerical example we are indebted to Dr. Leslie Kish. The 
data come from a large study of voting behavior conducted by the 
Survey Research Center, University of Michigan. (Stokes et. al. [1958]). 
By open-end questionnaires taken during the 1952 and 1956 elections, 
voters were rated as Democrats (D), Independents (J) or Republicans 
(R) and also as Unfavorable (U), Neutral (N) or Strongly Favorable (F) 
to Eisenhower’s personality. These are the two predictor measurements, 
each with 3 states, making 9 bivariate states. The voters were also 
asked whether they voted for Stevenson or Eisenhower, these being the 
two universes. The sample sizes were 1003 and 1439. . 

For illustrative purposes we have set up a rule classifying the 
subjects as Stevenson or Eisenhower voters by means of the predictor 
variates. The results of the classification are then compared with the 
actual voting behavior. 

Table 1 shows the relative frequencies p,; and p.; of the ith state 
for Stevenson and Eisenhower voters respectively. We assumed that 
c, = c, and that x, = x, = 3. The resulting classification rule is given 
on the right in Table 1. Persons falling into the states DU, DN, DF 
and IU are classified by the rule as Stevenson voters, since in these 
states Piz > Pai 

The estimated probabilities of misclassification are easily obtained. 
Any Eisenhower voter is misclassified if he falls into the states DU, DN, 
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DF or IU. Hence the frequency of misclassification for Eisenhower 
voters is 


033 + .079 + .090 + .017 = .219. 


The figure for Stevenson voters is 0.133 and the average is 0.176. 

As with continuous measurements, it is possible to examine whether 
a@ measurement contributes to the classification to a worthwhile extent. 
If political affiliation alone (D, J, R) is recorded, the probabilities in 
the three states are given in the lower part of Table 1, being obtained 
of course by addition from the upper part of the table. The probabilities 
of misclassification are now 0.201 for Stevenson voters and 0.202 for 
Eisenhower voters. Thus the addition of the attitude measurement 
reduces the average number of mistakes by about 13 percent. 

The use of qualitative variates in attempting to date certain of the 


TABLE 1 

ILLUSTRATION OF THE CLASSIFICATION RULE FOR Two MEASUREMENTS 

Resulting 

Pris pra classification 
State (i) Stevenson Eisenhower rule 
DU .347 .033 iS} 
DN .359 .079 Ss 
DF -094 .090 
IU -068 .017 8 
IN .078 .107 E 
IF -026 .154 E 
RU -002 .017 E 
RN .023 .176 E 
RF -004 327 E 
1.001 1.000 


ProsaBiuitres Usine AFFILIATION ALONE 


State Stevenson Eisenhower Classification 
D .800 202. 8 
I .172 278 E 
R .029 520 E 
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works of Plato (i.e. to arrange a number of universes in a time sequence) 
has been discussed by Cox and Brandwood [1959]. 

In the rest of this paper discussion will be confined unless otherwise 
mentioned to the case of two universes, with c, = c; = land x, = x, = }. 


3. EFFECTS OF THE FINITE SIZES OF THE INITIAL SAMPLES 


The initial sample from each universe serves two purposes. It is 
used to set up the classification rule and to estimate the probabilities 
of misclassification for specimens from the two universes, so that we 
can decide whether the classification is accurate enough to be satis- 
factory. 

Some notation will be needed. To avoid double subscripts, let U, 
U’ denote the two universes, and p; , p{ the true probabilities that a 
specimen will fall into the ith multivariate state. Independent samples 
of sizes n, n’ are drawn from the universes. The numbers of specimens 
falling into the ith state are r; , r{ , and the corresponding estimated 
probabilities are = 1r,/n, = 1//n’. Since the true p, , pj are 
unknown, the actual classification rule places a specimen in U if f; > pf, 
in U' if d. < 6!. If d, = 6! , the decision is made by tossing a fair coin. 

For given sample sizes n, n’ the values of #; and #{ will vary from 
sample to sample. Thus the actual classification rule and its per- 
formance.may change from sample to sample. In describing the 
consequences of finite sample sizes, we usually present the average 
results over all initial samples of given sizes n, n’. 

The principal consequences of the finite sample sizes are as follows. 

1. The probability of misclassification as estimated from the samples 
is, of course, subject to a sampling error that in moderately large samples 
may be shown to be approximately of the binomial type. 

2. On the average, taken over all pairs of samples from a given 
pair of universes, the actual probability of misclassification is greater 
than the theoretical optimum: i.e. than the probability that would 
hold if the true p, , pj were used to construct the classification rule. 

3. The average estimated probability of misclassification is less than 
the theoretical optimum probability. From 2, it is also less than the 
average actual probability. 

In other words, finite sample sizes involve two types of penalty. 
Owing to sampling errors in the #,; , #; , the rule obtained may not be 
the theoretical optimum, and the probability of misclassification is 
underestimated from the sample data. 

These results may be illustrated, in exaggerated form, by con- 
sidering samples of size n = 1 from universes having two states with 
the following true probabilities of falling into the states. With the 
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- optimum rule the probability of misclassification is 0.1 for specimens 
. from U, 0.05 for specimens from U’, the average being 0.075. 
Optimum 
State U U’ classification 
1 9 0.05 
2 1 0.95 
With n = 1, only four types of sample result are possible. Table 2 
r PM shows for each type the classification rule that would be set up, the 
of estimated probability of misclassification, and the actual probability 
- that would pertain if that rule were used. A ? means that classification 
‘ve is a toss-up. 
Z To explain the entries in Table 2, consider the last type of result. 
iy The wrong classification is made in both states. Nevertheless, if we 
% TABLE 2 
THE CLASSIFICATION RULE FoR SAMPLES OF 1 
4 Frequency Probability of misclassification 
: Sample result of occurrence Estimated Actual 
4 State | U U’ | Rule 
2i0 110 =0.855 0.0 0.075 
4 | 
i 210 oj? = 0.045 0.5 0.500 
i 
| 
1 ol? (0.1)(0.95) 
149 = 0.095 0.5 0.500 
| | 
(0.10.05) 
2-11 oj UV = 0.005 0.0 0.925 
Average 0.07 0.13875 
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believe the samples, we estimate a zero probability of misclassification. 
The actual probabilities are 0.9 for U and 0.95 for U’, giving an average 
of 0.925. 

The overall average actual probability of misclassification is 0.13875 
as compared with a theoretical optimum of 0.075 and an average 
estimated value of 0.07. 

The fact that the average actual probability exceeds the theoretical 
optimum arises, of course, because when p; > pj , there will be some 
pairs of samples in which #; S p{. Whenever this occurs, the actual 
rule for this state is not as good as the optimum. 

The fact that the average estimated probability lies below the 
theoretical optimum probability may be shown as follows. If p; > pj, 
the contribution of this state to the overall theoretical probability of 
misclassification is #pi . Since r{/n’ is an unbiased estimate of pj , 
the contribution may be written }E(r//n’), where E denotes a mean 
value. The estimated contribution, on the other hand, is } the average 
of the smaller of r;/n, r//n’, which will always be less than 3E(r//n’). 

The bias in the estimated probability and the inferiority of the 
actual to the optimum rule are due primarily to states in which p; 
differs little from pj . These difficulties can be largely avoided by 
withholding a decision, i.e. making no classification, in such states. 
For instance, suppose that preliminary samples of size 64 give the 
following results for a variate with four states. 


No. of specimens 


State U U' 
1 51 1 

2 8 5 

3 3 5 

4 2 53 
64 64 


States 1 and 4 provide good discrimination, but in states 2 and 3 
the estimated probabilities of a correct classification are only about 
0.6. Consider a recommended rule to withhold judgement when a 
specimen occurs in state 2 or 3. From the samples, we have an unbiased 
estimate 21/128, or 16 percent, of the proportion of specimens for which 
the rule makes no classification. This estimate is binomial, with stand- 
ard error ~/(0.16)(0.84)/128. When a decision is made, the probability 
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of misclassification is estimated as 3/107, or about 3 percent. This 
estimate will be almost exactly a binomial variate, with negligible bias. 

If, on the contrary, a decision is made in all states, we would like 
to obtain from the initial samples a reasonably unbiased estimate of 
the probability of misclassification for the actual rule developed from 
the samples. An estimate of the difference between the actual and the 
theoretical optimum probability of misclassification is also of interest, 
particularly when it is not too costly to increase the size of the initial 
samples. These topics are discussed in Sections 4 and 5. 


4. ESTIMATION OF THE THEORETICAL 
OPTIMUM PROBABILITY 


As shown in Section 3, the sample probability of misclassification 
is a negatively biased estimate of the theoretical optimum probability. 
For a given value of (p; + p/), it appears intuitively that the under- 
estimation will be greatest when p; = pi. In this case the bias in 
the ith state is in absolute value, 

ap; — {3H min. , 

where #,; , J; are independent binomial estimates of p; . An algebraic 
expression for this quantity as a function of p,; , can be developed. 
from the binomial distribution, but is unwieldy unless n, n’ are small. 
An approximation to the bias may be obtained by regarding #,; , p/ 
as normally distributed. Since the average value of the smaller of 
two normal deviates is known to be —0.56, the bias in the estimate 
of p; when both samples are of size n is approximately 


—0.56 V p.q;/n. 


With p, = pi = 0.05, n = 50, for instance, this estimate gives — 0.0173 
as against the correct value of —0.0170. 

Since this problem will occur mainly in states with low frequencies 
(unless the ciassifivation rule is poor), a relatively crude sample ad- 
justment for bias shouid be adequate for estimating the overall theo- 
retical optimum probability of misclassification. One suggestion is to 
use (f; + #/)/4 instead of one-half the smaller of #,; , 6; when estimating 
the probability of misclassification, this adjustment being made in 
states in which #; , #{ do not differ significantly at, say, the 20 percent 
level of significance. In the numerical example above, the adjustment 
is made in states 2 and 3. It amounts to estimating the theoretical 
optimum probability as 


1+65+4+4+2 13.5 
128 ~ 128 
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as compared with the unadjusted estimate of 11/128 or 8.6 percent. 

Examination of individual cases indicates that this adjustment 
removes most of the bias in states in which p; = pi. When p; ¥ p/ 
the adjustment tends to over-correct, and on the whole is likely to be 
conservative. 


5. THE AVERAGE INCREASE IN THE ACTUAL PROBABILITY 
OF: MISCLASSIFICATION 


The average increase in the actual over the theoretical probability 
of misclassification may be expressed as a function of the p; , pj and 
the sample sizes n (assumed equal). In the ith state, if p; > p{ , the 
optimum rule gives no misclassification for specimens from U, but 
misclassifies the proportion p/ of specimens from U’ that are in this 
state. The average frequency of misclassification is therefore 3p{ . 
With the samples, there is no increase in this frequency provided that 
r;>r.. Ifr; =1ri,wetossacoin. This gives an increase in frequency of 


i(p; + pi) — = i(p; — pi). 


If r; < rj, we make the wrong decision, with an increase in frequency 
of mistakes of 


3p; — = — Pi). 
Hence, over all states for which p; > p{ , the total expected increase 
in frequency of misclassification may be written 


with an analogous expression for the states in which p; < p{ . 

The probability inside the curly brackets may be obtained from 
tables of the binomial distribution. The normal approximation to 
this quantity is surprisingly good, even for moderate samples and 
small p; , pj. With this approximation, the estimate is the probability 
that a normal deviate is less than 


_vnlp. — pi | 
V Didi + iG: 


No correction for continuity is applied, because of the presence of the 
term $Pr. (r; = rj) in the exact expression. As an example of the 
accuracy, with p; = 0.03, p{ = 0.01, n = 100, the normal approxima- 
tion gives 0.1555 as compared with the exact value 0.1567. 

Use of the normal approximation is illustrated in Table 3. From 
the #; , pf for each state the normal deviate is computed from (5.2) 
and the probability is read from the normal tables. The difference 


(5.2) 
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TABLE 3 


Estimation oF ActuaL Minus Optimum PROBABILITY 
OF MISCLASSIFICATION 


No. of specimens 
State U U' N.D. Probability lpi — Bs | 
i 51 1 Large 0.0 _ 
2 8 5 —0.880 0.189 0.0469 
3 3 5 —0.732 0.232 0.0312 
4 2 53 Large 0.0 _ 
Total 64 64 


between the average actual and the theoretical probability of mis- 
classification is estimated as 


4{(0.189)(0.0469) + (0.232)(0.0312)} = 0.0080 = 0.8%. 


At the end of Section 4 the theoretical probability of misclassification 
was estimated as 10.5 percent. Adding 0.8 percent, we estimate the 
expected actual probability as 11.3 percent. 

‘In order to obtain a general impression of the effect of sample size, 
some study was made of expression (5.1) for the average increase of 
the actual over the theoretical optimum probability, as a function of 
Pp: , p,m, and S, the number of states. The difficulty in studying this 
function lies in the substantial number of parameters involved when 
the states are numerous. A preliminary search was made for a smaller 
number of parameters which might dominate the performance of the 
function. It became evident that the function depends on the overall 
optimum probabilities of misclassification, i.e. on the quantities 

As P, P’ diminish, i.e. as the optimum rule becomes better, the average 
increase also diminishes, i.e. the actual rule is less likely to differ from 
the theoretical optimum. Even with the quantities P, P’, n and S fixed, 
the average increase may vary from almost zero up to a maximum, 
depending on the way in which p, , p{ distribute themselves among the 
states. We decided to tabulate the maximum increase for given P, 
P’, n and S, although aware that this might produce an unduly pessi- 
mistic view of the effect of finite sample size. The worst possible 
distributions of the p,; , p{ were found by trial and error. In most 
situations, though not in all, the worst case occurs when the probability 
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P’ is distributed equally among all but one of the states for which 
p;i > pt. As an example, for 8 states, with P = P’ = 0.1, the worst 
distribution in the first four states is as follows. 


Pi 
.69 

07 .0333 
07 .0333 
07 .0333 


Table 4 shows the maximim average increases forn = 50 
S = 4, 8, 16, 32; P = P’ = 0.05. 0.1, 0.2, 0.3, 0.4. 


TABLE 4 
Maximum AVERAGF INCREASE IN PROBABILITY OF 


n = 50 n = 100 
Opt. Prob. Number of states Number o. states 
Pai 4 8 16 32 4 8 16 3% 
0.05 .009 | .018 | .032 | .059 | .006 | 011 | .020 03-- 
0.1 ; Oil .C22 039 | .067 008 015 , .025 041 
0.2 014 | .027 049 | .082 010 | 20 | .033 .052 
0.3 -116 | .034 057 | .098 011 | .023 | .038 | .062 
0.4 .025 | .037 061 .O71 12 | .026 | .041 .061 
| 


As mentioned previously, the average inc -#se is smallest when 
P, P’ are small. It becomes larger as the numer of states increases, 
being somewhat less than doubled for eacl. dou»ling of the number of 
states. The average increases for n = 100 are _ughly 1/+/2 times 
those for n = 50. Some irregularities in che table will be noticed fo» 
P = 04. The explanation is that the actual rule always gives a‘ 
least 50 percent correct classifications. This imposes an upper limi: 
to the maximum average increase for P = 0.4. 

It is difficult to give general recommendations from this analysis. 
The following comments may be appropriate. 

1. Preliminary sampies of sround 50 specimens from each universe 
should give a good indication of those multivariate states in which 
the classification will be accurate, ana of those in which it will be 
dubious. 
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2. If a classification is to be made for all multivariate states, the 
estimated probabilities of misclassification should be increased by the 
two adjustments for bias. y 

3. If the number of multivariate states does not exceed 8, Table 4 
suggests that samples of size 50 should make the actual rule satis- 
factorily close to the optimum in most situations. With 8 states and 
n = 50, the actual probabilities in the worst cases are 6.8, 12.2, 22.7 
and 33.4 percent, for theoretical optimum probabilities of 5, 10, 20 and 
30 percent, respectively. For larger numbers of states, substantially 
greater samples are indicated. 


6. THE EFFECT OF REPLACING A NORMAL VARIATE 
BY A QUALITATIVE VARIATE 


When some variates are continuous and some qualitative, Rao’s 
general rule for classification still applies. However, it presents difh- 
culties in application, as Linhart [1959] has pointed out, since a dif- 
ferent continuous discriminant is required for each multivariate state 
defined by the qualitative variates. 

In this section the loss of discriminating power when normally 
distributed variates are partitioned into qualitative variates is examined. 
It is assumed that the variates are independent. This assumption is 
unrealistic for applications, but we have not been able to reach general 
results with correlated variates. With independent variates, simple 
asymptotic results can be obtained when the number of variates becomes 
large, and can be checked against the exact results for a small number 
of variates. 

Let the vth continuous variate be denoted by y, , (v = 1, --- , k), 
and the two universes by A and B. The variate y, is normal, with s.d. 
unity in each universe, and with mean 0 in A and 4, in B, the quantity 
6, being “the distance apart” of the two populations. The larger 6, 
is, the better y, is for classification. 

Given k independent variates, it is easy to show that the best 
continuous discriminator, when divided so that its standard deviation 
is 1, is the quantity 


R. Zz 5,y./ ° (6.1) 


For this discriminator the distance between the two populations is 
V> &. As k becomes large with all 5, > 0, the distance becomes. 
so great that classification is almost perfect. In order to discuss the 
situation in which some mistakes in classification are made, we assume 


that as k becomes large the 4, tend to zero in such a way that >> & 
remains finite. 
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The continuous classification rule assigns a specimen to B if R, = 
& . Since R, is normally distributed, the probability of mis- 
classification, which is the same for both universes, is the probability 
that a uormal deviate is greater than 3-/ >. &. 

Let each continuous variate be partitioned into a qualitative variate 
with six states. Results for fewer than six states are derived as particular 
cases of the result for six states. For the vth variate, let Pr. (j, | A), 
Pr. (j, | B) be the probabilities that a specimen from A and B respec- 
tively falls into the j,th state. The 6‘ multivariate states into which 
the specimen can fall may be specified by recording the states j, , 
je , -** , je for the individual variates. Since the variates are inde- 
pendent, the probabilities that a specimen falls into a given multivariate 
state are 


k k 
I] Pr. G, | A) and I] Pr. | B). 
Consequently the optiraum discrete rule places a specimen in A if 


Pr. G14) > 


i.e. if 
> {log Pr. (j, | A) — log Pr. (j, | B)} > 0. (6.2) 
This may be written 
2. > 0 (6.2)’ 
where 


x, = log Pr. (j, | A) — log Pr. Gj, | B). 


The quantity >> z, is a discrete random variable with 6° possible 
values, since its value depends on the multivariate state into which 
the specimen falls. It will be shown that as k becomes large, this 
quantity tends to be normally distributed. 

The way in which an individual variate is partitioned i is shown in 
Figure 1. The six states are labeled, for mnemonic reasons, A, a, a, 
6, b, B: they might be called “strongly, moderately, slightly favorable 
to A’, etc. The partition involves two disposable numbers u, , us , 
which determine the sizes of the regions a, 8 and a, b. (Four numbers 
might be used, but symmetry suggests that two will give the best 
partition.) Strictly, the numbers should be denoted by u,, , Us, , 
since it may be profitable to vary them with v, but for simplicity the 
subscript v will be omitted from u, , u, and 6 at present. 
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Universe A Universe B 


FIGURE 1 
PARTITION OF A. NORMAL VARIATE INTO Srx STaTES 


For an individual variate v Table 5 shows the notation used for the 
probabilities of falling into the respective states. 

The probabilities p,; are integrals over the parts of the normal curve 
indicated in Figure 1. Note that the Pr. (| B) values are the same as 
the Pr. (| A) values in reverse order. 


TABLE 5 } 
Prosasiuities THat A VARIATE WILL FALL INTO THE 6 STATES 
State 
A a a@ B b B 
Pr. (| A) pi D2 Ps pa Ps De 
Pr. (| B) Ps Ps Pa Pa 
We W3 —Ws —We 


The last line of Table 5 shows the six possible values taken by 
the random variable 


x = log Pr. (| A) — log Pr. (| B) 
where 
log + Ws = log (p2/ps), ws = log (p;/ps). 
It follows from Table 5 that 
E(x | A) = w,(pi — po) + w2(p2 — Ds) + ws(ps — Pa) 


while E(z | B) has the same value with sign reversed. The variance 
has the same value for both universes, namely 


V(z) = wilp, + pe) + wilps + Ps) + + ps) — [EC | AD)’. 
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Reverting to inequality (6.2)’, its left side is the sum of a large number 


of independent variables z, and hence tends to become normally dis- 
tributed as k becomes large. 

It remains to obtain the values of E(x | A) and V(x) when 6 tends 
to zero. The leading terms in the expressions for the p,; and the w; 
are found to be as follows. 

Let 


where z(t) is the normal curve, and let 2 , 2; , 2. be the ordinates of 
this curve at 0, u, , uw , respectively. Then, from Figure 1, 


2(t) dt = I, — Ip + — 2%). 
4 
Similarly, 
ps = — 1, + 36% — a), = — 1, — — %), 
ps = I, — I, — 48(z, — 2:), pe = I, — 4622, 


Substituting these values in E(x | A) and V(z), retaining only the terms 
of order 6’, we find 


V(z) = | A) = 2 ,w), 6.4) 


writing f(u, , uw.) for the expression in brackets in (6.3). 

We now reintroduce the subscript v in order to sum over the k 
variates. From inequality (6.2)’ the probability of misclassifying a 
specimen from A is the probability that the random variable ). (z, | A) 
if negative. From (6.3) and (6.4) this variable is approximately normally 
distributed with 


Mean = : Variance = 2 > Uns): 


mews the oubebiien of misclassifying an A is approximately the 
probability that a normal deviate exceeds 


(urs Us,)/2. (6.5) 
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In order to minimize the probability we must maximize (6.5). Since 
{(u:. , Us) does not depend on 6, , expression (6.5) shows that the 
maximum is attained by substituting the same values of u,, , ws, in 
every term, i.e. the values which maximize f(u, , u,). This reduces 
(6.5) to 


Vmax. fu: us) 82/2. (6.6) 


With the untransformed normal variate it was shown earlier that 
the best rule gives a probability of misclassification equal to the proba- 
bility that a normal deviate exceeds 3/ >> 8&2 . Comparing this 
result with (6.6), the relative discriminating power of the best quali- . 
tative to the best continuous rule is 


A , (i , 
2 max. 4) = 2 max. {2 + 4 og (6.7) 

This result holds in the sense that if a randomly chosen fraction 
2 max. f(u, , U2) of the original normal variates is retained, discarding 
the rest, the best continuous rule becomes equivalent to the best quali- 
tative rule based on all the variates. 

The result for five states is obtained by ignoring the distinction 
between the states a and 8, calling the combined state D (doubtful). 
The effect is that the last term on the right of (6.7) disappears. For 
four states we let u, = 0, the states becoming A, a, b, B. For three 
states, the states a and b are combined into a D region. Finally, for 
two states we put u, = 0. 

The form of the expression 2f(u, , u), the values of u, , us giving 
the best partition of the normal curve, and the relative discriminating 
power of the qualitative variates appear in Table 7 for 2, 3, 4, 5 and 
6 states. 

Since the maxima of f(u, , us) are flat, the best values of u, and us 
are given only to one decimal. For two states, the relative power is 
2/x, which will be familiar as the asymptotic power of the sign test. 
Use of five or six states retains over 90 percent of the power. 

The function 2f(u, , u2) has already appeared in two other problems 
involving the replacement of normal curve methods by less efficient 
methods. Ogawa [1951] obtained this function as the efficiency of 
estimation of the population mean from suitably chosen order statistics 
in large samples. D. R. Cox [1957] found the same expression for the 
relative amount of information retained by grouping the normal curve 
when the group boundaries are chosen to minimize the quantity 
E{x — p(z)}*/o’, where u(x) is the mean of the group to which z is 
assigned. So far as we can see, the three problems are mathematically 
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TABLE 7 


Tue Asymprotic RELATIVE. DiscRIMINATING POWER OF QUALITATIVE 
To Continuous NORMAL VARIATES 


Best values 
No. of of Relative 
states 2f(ur , ue) power 
2 — 0.636 
3 223/Is — 06 0.810 
fz (zo — 22)? 
4 + — 1.0 0.882 
| 2 
ze — 22)? 
5 24— + 0.4 1.2 0.920 
I, I; Te 
— 22)? (zo — 24)? 
6 24— + ao 0.7 1.4 0.942 
I, 


different, and the identity of results, as Cox has pointed out, seems to 
depend on particular properties of the normal curve. 


7. COMPARISONS FOR SMALL NUMBERS OF VARIATES 


As a check on the utility of the asymptotic results for practicai 
situations, the best points of partition and the resulting probabilities 
of misclassification were computed directly for smail numbers of vanates. 
For simplicity the distance apart 6 was assumed to be the same for 
all variates. 

Having calculated the probability of misclassification for a given 
5 and number of variates k, we can compute the number k’ of continuous 
normal variates that are needed to give the same probability of mis- 
classification. For these values of 6 and k, the ratio k’/k gives the 
relative discriminating power of the qualitative to the continuous 
normal variates. Unlike the asymptotic case, this ratio is of course 
dependent on the values of 6 and k. For fixed k, however, the ratio 
turns out to be almost constant to two decimal places over the range 
of probabilities of misclassification that are of practical interest. 

In the top half of Table 8, the relative discriminatory power with 
two variates is compared with the asymptotic power from Table 7 
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TABLE 8 
RELATIVE DiscRIMINATING PoWER OF QUALITATIVE 
to Continuous NorMAL VARIATES 
FoR SMALL NUMBERS OF VARIATES 


2 states 3 states 4 states 5 states 
k=2 k=o@ k=2 k=2 k= k=2k=o 
50 64 74 81 -84 88 89 92 
2 states 3 states 
k k 
2 3 4 5 6 7 © 2 3 4 5 © 


for 2, 3, 4 and 5 states. The “small-sample’ powers are all lower 
than the asymptotic values. 

The lower half of Table 8 shows the results with two states and 
from 2 to 7 variates. The results have one curious feature. With an 
odd number of variates the relative powers are higher than the asymp- 
totic values. The explanation may be that none of the decisions re- 
quires the toss of a coin when the number of variates is odd. In line 
with this result is the tact that with two states and only one variate 
(k = 1) the classification rule is the same as that obtained from the 
original norma! variate, so that the relative power is unity. With 3 
states (Table 8) this peculiarity does not appear, the relative powers 
increasing steadily with k towards the asymptotic value. 

The optimum values of u, and u, for small numbers of variates 
were consistently close to the asymptotic optima. This result held 
throughout a wide range of values of 6 ~k from 0.5 (corresponding 
to a probability of misclassification of over 40 percent) to 4. Since, 
moreover, the optima are flat, the asymptotic values of u, and u, 
may be used safely with only two variates. 

A few comparisons have been made between the qualitative and 
continuous discriminators for the case of two correlated variates of 
equal discriminating power. Suppose that the continuous scales are 
arranged so that the variates have standard deviations unity, with 
means zero in population A and 6 in population B. If they are positively 


' correlated, as appears to be the more common situation in applications, 


results obtained for 2 and 3 states indicate that the qualitative dis- 
criminator has higher relative power than when the variates are in- 
dependent. The reverse seems to hold with negative correlation. 
Consequently our results for independent variates cannot be assumed 
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to be valid for correlated variates. More investigation is needed. 

In order to transform a continuous into a qualitative variate in 
an application we compute the means g, , 7, of the continuous variate 
in the initial samples from the two universes and the pooled within- 
universes standard deviation s. Let 9 = (j4 + 9s)/2. With, for 
instance, five states the best asymptotic values of u; , vu. are u, = 0.4, 

= 1.2. The states are constructed as follows. 


Value of y State 


less than 7 — 1.28 

from 7 — 1.2stog — 
from 7 — 0.48 tog + 
from 7 + 0.48 to 7 + 
greater than 7 + 1.28 


Pee 


8. ESTIMATION OF THE RELATIVE FREQUENCY WITH WHICH 
SPECIMENS COME FROM THE TWO UNIVERSES 


If specimens present themselves with unequal frequencies z, 7’ 
from the two universes, the optimum rule is to assign a specimen to 
U if xp; > x'p{. This rule requires an initial estimate or guess about 
the value of x. With qualitative variates the accuracy of this figure 
will often be not critical, because the same optimum rule holds over 
a range of values of z, although the range becomes shorter as the number 
of states increases. Table 9 gives an illustration for 4 states. The 
horizontal line marks the boundary of the best decision region. 


TABLE 9 
Tae Oprmum ror DirFERENT VALUES OF 
Range of x Range of x Range of + 
O11 — .4 .64 .64 — .99 
State pi Pi 
1 -92 .01 92 .01 92 .01 
2 .04 06 .64 06 .04 
3 -04 .07 04 .07 04 .07 
4 -00 .88 00 .88 00 .88 


4 
= 
| 
| Al 
4 
+ 
: | 
oth 
| 
| 
| 
| 
wii 
ve | 
| 
| 
| 
| 
Ai 


30 BIOMETRICS, MARCH 1961 


If we guessed = x’ = 0.5, we would use the middle classification 
rule in Table 9. This remains the best rule so long as x lies between 
0.4 and 0.64. Even with x outside these limits, this rule may be close 
to the best. For instance, if x is 0.7, the average frequency of mis- 
classification using our rule is (.7)(.04) + (.3)(.05), or .043, as against 
(.3)(.12), or .036 with the optimum rule. With x = 0.8 the corre- 
sponding figures are .042 and .024, a more serious difference. 

When m specimens have been classified, the data give the numbers 
of specimens r; found in each state, where ym r; = m. If the p; , pi 
are known, the r; follow a multinomial distribution with probability 


P; = rp; + 
in the ith state. The log likelihood is 


s 

Int — pi) 
L = log (xp. + - (8.1) 

so that the estimated variance of # is 
— 
= Ap, + #’p (8.2) 


The maximum likelihood estimate # obtained ‘i. setting 0L/dx = 0 
in (8.1) can be found by trial, although this is tedious if the states 
are numerous. One way of obtaining a first trial value of is to note 
that each state furnishes an unbiased estimate. Since 


E(r;) = mP; = m(rp; + 2’p'), 
the estimate #; from the ith state is 


t; = m) — (p; = Pp) =P Q./ m(p; pi)’. 


These estimatés are likely to differ markedly in precision. A fairly 
good method for a first trial # is to combine the states for which p; > p , 
making a single estimate from these states. Table 10 shows a numerical 
example for a sample of 100 specimens. 

From the first two states a trial value is computed as 


The values of r,(p; — P; and r,(p, — p/)/P, are next computed 
(Table 10). The total of the latter values gives 0L/dx as —5.2. Since 
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- TABLE 10 
EXAMPLE OF THE M. L. EstrmaTION OF + (COMPUTED FOR # = .78) 


Pi — pi) P; ri(pi — pt)/Ps 
.5 37 14.8 35.9 
| 30 6.0 . 256 23.4 
2 9 —0.9 .122 —7.4 
24 —12.0 .210 —57.1 


100 —5.2 


oL/dm is a decreasing function of « the second trial must be lower. 
The estimate #, = 0.75 gave +0.5 for the first derivative. From (8.2) 
the estimated standard error of #, was found to be +0.075. 

Examination of the formula for the standard error shows that the 
estimate # cannot be expected to be precise: the standard error is 
always larger than the value ~/x(1 — x)/m that would apply to a 
binomial estimate of x. This suggests that it will be worthwhile to 
estimate x only after a substantial number of specimens have been 
classified. 


9. SUMMARY 


This paper deals with the problem of assigning specimens to one 
of two or more universes when the measurements on each specimen 
are qualitative, each taking a small number of states. After presenting 
the optimum rule for classifying the specimens, three problems are 
considered. 

The construction of the rule requires initial data on a number of 
specimens known to be classified correctly. Standard classification 
theory assumes that these initial samples are infinite in size, although 
in practice they may be only moderate. The principal effects of the 
finite sizes of the initial samples are that the probability of misclassifi- 
cation of the rule derived from them is underestimated and that this 
rule may be inferior to the theoretical optimum rule that we could 
construct if we had infinite samples. Methods are proposed for ob- 
taining reasonably unbiased estimates of the performance of rules 
derived from finite samples and for estimating the difference between 
the actual and the theoretical optimum probability of misclassification. 
It appears that initial samples of size 50 from each of two universes 
should be adequate if there are not more than 8 multivariate states. 
With greater numbers of states, larger sample sizes are needed to ensure 
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that the actual rule will be almost as good as the theoretical optimum. 

If most of the variates are qualitative but a few are continuous, 
one possibility is to transform the continuous variates into qualitative 
ones, particularly since classification is easier with qualitative than 
with continuous variates. Asymptotic results are obtained for the best 
points of partition and the probabilities of misclassification when a 
large number of independent normal variates are partitioned to form 
qualitative variates. For qualitative variates with 2, 3, 4, 5 and 6 
states the relative efficiencies are 64, 81, 88, 92 and 94 percent re- 
spectively. Computations for small numbers of variates show that 
the asymptotic points of partition remain satisfactory although the 
relative efficiencies are in general lower. 

The optimum rule depends on the relative frequencies with which 
specimens to be classified present themselves from different universes. 
Initial estimates of these frequencies must be made in order to set 
up the rule. With two universes, maximum likelihood estimates of 
the frequencies from the data for specimens that have been classified 
by the rule are given. These estimates enable the rule to be improved 
if the initial estimates differ from the frequencies that apply when the 
rule is being used. _ 
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THE STANDARDISATION OF TUBERCULIN 
HYPERSENSITIVITY 


M. Stone 
British Medical Research Council Applied Psychology Research Unit 


R. A. Bruce 
- Chest Clinic, Cambridge, England 


INTRODUCTION AND SUMMARY 


Tuberculin hypersensitivity results from stimulation of the reticulo- 
endothelial system by tubercle bacilli and is due to the production of 
sensitised lymphocytes, that is, lymphocytes carrying antibody specific 
towards tuberculin. The presence of this antibody is detected by the 
Mantoux Test in which fluid containing tuberculin is injected into the 
superficial layers of the skin. This intradermal injection forms a bleb 
which quickly spreads outwards and is absorbed, depositing the tuber- 
culin in the superficial layers. After deposition the tuberculin either 
escapes into the circulatory system or is reacted upon in situ by the 
antibody resulting in a final distribution of ‘reaction products” about 
the point of injection. It is postulated that induration or hardening 
of the skin will occur wherever the areal density of these products 
exceeds a certain level and that this level actually obtains at the edge 
of induration. In other words the areai density of tuberculin decreases 
from the point of injection and induration occurs wherever it exceeds 
a certain minimum level, the Minimum Tuberculin Concentration 
(c.f. Miles [1949]). 

In this paper a mathematical model is presented by which the 
antibody-level may be calculated from a knowledge of the dose and 
concentration of tuberculin used and the resultant area of induration, 
provided this is non-zero. A statistical model of the multi-hit type 
deals with the antibody-level induced by different amounts of baciliary 
stimulation and thereby provides a measure of the virulence of the 
particular tubercle bacillus. The models can be applied to the resuits 
of skin-tests for any antibody by means of an appropriaté antigen 
but for convenience they will be explained in the context of data from 
the Mantoux Test. 

R. A. Fisher [1949] made a statistical analysis of the results of 
intradermal injections in cows in order to standardise different tuber- 
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TABLE 1 


Errect oF TUBERCULIN DosE In ConsTANT VOLUME 
ON DIAMETER OF INDURATION 


Diameter mm 
No. of Areal 3 
subjects 10 T.U. Difference 
Group I 

2 11.0 2.0 117 

8 12.0 4.0 128 

ll 12.5 5.0 131 

1 12.5 5.5 126 

10 12.5 6.0 120 

5 13.0 6.5 127 

8 13.0 7.0 120 

2 13.0 7.5 113 

4 13.5 7.0 133 

1 13.5 7.5 126 

ill 13.5 8.0 118 

2 14.0 7.0 147 

2 14.0 8.0 132 

1 14.0 8.5 124 

3 14.0 9.0 115 

1 14.5 8.5 138 

15 14.5 9.0 129 

1 14.5 10.0 110 

4 15.0 9.5 135 

17 15.0 10.0 125 

3 15.5 10.0 140 

1 15.5 10.5 130 

2 15.5 11.0 119 

2 16.0 10.5 146 

2 16.0 11.0 135 

2 16.0 12.0 112 

1 16.5 11.5 140 

12 16.5 12.0 128 

1 17.0 12.0 145 

1 18.0 13.5 142 

10 18.0 14.0 128 

2 18.5 15.0 117 

2 19.5 16.0 124 

4 20.0 16.0 144 

2 20.0 16.5 128 

| 1 20.5 16.5 148 
1 22.0 19.0 123 
é 2 23.0 20.0 129 

> 1 27.0 25.0 104 
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FABLE 1-—(Continued ) 


Diameter mm 

No. of : Areal 
subjects 10 T.U. LEU. Difference 
Group II 

10 12.0 4.5 124 

11 12.5 6.0 120 

5 14.5 9.0 129 

5 15.0 10.0 125 

3 20.0 16.0 144 

6 20.0 16.5 128 

culins. He concluded precision of such (tuberculin) r-uings 


regarded as biological assay seems to have been much underrated.” 
Our work shows that a similar precisicu can be revealed by statistical 
and mathematical analysis for observations on humev subjects and 
guinea-pigs. 


EXPERIMENTAL MATERIALS AND METHODS 


Tests were made on 260 human subjects and 10 guinea-p:gs. ‘The 
tuberculin was PPi) (human) from the Ministry of Agriculture and 
Fisheries Institute, Weybridge. The B. ©. G. was the liquid form from 
the Danish Serum Institute and the virulent tubercle bacillus was 
the strain Myc. Tub. H37Rv from the National Institute of Type 
Cultures. The injection-techniquc was carefully designed to reduce 
the error variance of the volumes injected. The diameters of induration 
were measured to 0.5 mm. at 72 hours after injection. 


RESULTS AND THEORY 


i. The relation between induration-area and dose of tuberculin in constant 
volume 


About 250 subjects were given intradermal injections of 1 and 10 
tuberculin units (T. U.’s) in 0.1 mi. Table 1 gives the results of those 
161 subjects (Group I) who gave readable non-zero reactions to both 
doses. Another 40 subjects (Group II), known to be negative reactors, 
were BCG-vaccinated and then tested with 10 and 100 T. U.’s in 
0.1 ml. Three levels of BCG-dose were used which explains the bunching 
of observations in this group. 

The square of the diameter of induration D will be called the in- 
duration-area, written as D? , Di, , Dioo for the three tuberculin doses. 
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100 200 300 400 500 600 700 
FIGURE 1 
Tue AREAS OF THE PAIRED TUBERCULIN REACTIONS OF Groups I AND II 


The areas are plotted in Figure 1. The areal differences A (either 

2, — D3 or D3. — D3.) are given in the final column of Table 1. There 
is no discernible regularity in the A’s of either group and 95 percent 
confidence limits for the difference of their means are — 1.4, 3.8. Pooled, 
their standard deviation of 7.5 is consistent with constancy and an 
observational error of standard deviation 0.21 mm. For a representative 
areal difference is derived from reactions of sizes D’ and D with means 
15 and 10 mms. respectively. Then 


5A > 30 5D’ — 20 6D 
s.d. A = (900 var 5D’ + 400 var 6D)'” = 7.5. 


These results suggest that the relation between the induration-area 
and the dose d of tuberculin in T. U.’s is 


= a* logd + B 


where a* is a constant for all 201 subjects and £ is a personal constant 
containing all the information in the reaction relevant to the subject’s 
sensitivity. The least-squares estimate of a* is 126.7 (s. d. 0.5). For 
a given subject the critical dose of tuberculin which will just produce 
induration is obtained by putting D = 0 in equation (1). 

Additional confirmation of the equation was provided by testing 
four subjects with a range of doses in 0.1 ml. The results are given 
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TABLI 2 
THE OBSERVED AND PRupICTED INDURATION AREAS AT CONSTANT VOLUME 
Dose of PPD in T.U. (d) 
Subjects 10 5 2.5 1 0.5 0.25 0.1 
1 256 225 196 — 100 -- 6 
(218) (180) (91) (3) 
2 144 110 72 20 0 0 0 
(106) (68) (17) (0) (0) (0) 
3 144 110 72 16 0 0 0 
(106) (68) (17) (0) (0) (0) 
4 110 -- 36 a — 0 0 
(34) (0) (0) 


in Table 2 where the bracketed figures are the areas predicted from 
equation (1) with a* = 126.7, using the observed areas at 10 T. U. 
to estimate 8. 


2. Effect of change of volume of injected fluid on (i) induration-area (ii) 
area covered by fluid. 


16 subjects were each given four simuitaneous intradermal tuber- 
culin injections, 10 and 1 T. U. in 0.1 and 0.2 ml., and two intradermal 
injections of Evans blue in volumes of 0.1 and 0.2 ml. The results 
are given in Table 3 where F is the diameter to which the dyed fluid 
spread after two hours. For the tuberculin results, at each volume the 
areal difference A from 1 to 10 T. U. is calculated for the cases when 
both areas are non-zero. The average of the A’s at 0.1 mi. is 133.2 
which is not significantly different from 126.7. Half the average at 
0.2 ml. is 121.8 which is only 4.9 less than 126.7. The suggestion is 
that the rate of change of area with log-dose is proportional to the 
volume of fluid V. (In fact, for all but one subject the .i at 0.2 ml. 
is less than twice the A at 0.1 ml., so that the relation is probably not 
exact.) Equation (1) can be extended to give 


D* = &V logd + A(V). (2) 


Comparison with equation (1) shows that 6(0.1) = 6 and 0.1 & = a’*, 
so that & can be estimated as 1267. For the dye-injections, the results 
suggest that F’/V is independent of V or that 


F* = «V. (3) 
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Model for the deposition of tuberculin 


By requiring that d exp (— D’/aV log,, e) be constant for constant 
V, equation (2) suggests that the fluid spreads and is absorbed as ?#f, 
before absorption, it assumed the shape of a circular-normal distri- 
bution truncated at diameter F and with a variance s’ proportional to V. 

For such a distribution the areal concentration of fluid f(r) at 
radius r would be 


fr) = [V exp — exp (—F*/88’)] 


forr < 3F. If 5V is the extra fluid conceptually needed to extend the 
circular-normal distribution to infinity then 


V/(V + 8V) = 1 — exp = 


say. If 8s? = aV where a is a constant, then use of equation (3) gives _ 


1 — exp (—«/a) (4) 


so that u is also a constant. If ¢(r) is the areal atattad of deposited 
tuberculin at radius r in T. U.’s/mm’, then 


Ur) = 4d exp (—4r’/aV)/mapV. (5) 


Denoting the Minimum Tuberculin Concentration by 7, this will obtain 
at the edge of induration provided D < F; so substituting r = 34D 
and ¢(4D) = 7 in equation (5), we obtain 


= aV log, (4 (6) 


When D = F, the Minimum Tuberculin Concentration may be exceeded 
at the edge of induration and equation (6) will not be true. Reference 
to Table 3 shows that usually D < F. 


Agreement with the data 


Equation (2) agrees with equation (6) when & = a log, 10 and 
B(V) = &V log (4 log,10/xa@urV). By equation (4) we find that 
the four levels of x represented in Table 3 have corresponding levels 
1 — of 10-**, and 10°°*. So for these subjects 

=> 1 and the relation becomes ; 


D? = aV log (4d log, (7) 


to a good approximation. (It is, however, conceivable that, if more 
subjects had been examined, smaller values of » would have been 
found.) Equation (7) predicts the relationship between D and V when 
d is held constant. Table 4 gives the results of calculations of log + 
for the subjects of Table 3. For each subject, the model is supported 
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by (i) the constancy of the log 7’s calculated from non-zero diameters, 
(ii) the fact that for a zero diameter the log r calculated by putting 
D = 0 in equation (7) is smaller. From equation (7) it follows that, 
when V is increased, D’ will also increase but that a volume is ulti- 
mately reached above which D? will decrease. When d = 10 T. U.’s 
and log r exceeds — 1.238 or when d = 1 T. U. and log 7 exceeds — 2.238, 
we would expect the D? for 0.1 ml. to exceed that for 0.2 ml. The 
expectation is supported by subjects 1, 2, 3 and 4. For subjects 1 
and 2 at 10 T. U.’s and subject 4 at 1 T. U., the effect of increasing 
V is to dilute the tuberculin so much that at no point in the entire 
area of the reaction is the Minimum Tuberculin Concentration reached 
and no induration develops. For subjects 5-16 this dilution effect is 
more than offset by the increased spread of the fluid which results in 
tuberculin being carried further from the site of injection at more than 
the necessary minimum concentration. 


3. Antibody level related to (i) Minimum Tuberculin Concentration and 
(tt) bacillary stimulation. 

(i) Let ¢(T) be the areal density of tuberculin at a particular point 
at time 7 after deposition. The rate of escape of tuberculin into the 
circulatory system without reaction with antibody will be [¢’(7)|z = 
—«t(T) where «¢ is a constant for a given subject (Kety [1949]). If \ 
is the antibody density, the rate of accumulation of reaction products 


will be r’(T) = At(T) and the rate of reaction of tuberculin will be 
[t’(T)]z2 = —or’(T) where w is a constant. Therefore 


#(T) = + = —(e + od)K(T) 
(T) = #0) exp + od)T] 
r(T) = d4(0){1 — exp + wd)T]}/(e + wd) 
= d#(0)/(e + wd). 
If induration occurs wherever r(@) > p then 
r=7/rX+ 6 (8) 
where y = ep and 6 = wp. 
(ii) The multi-hit model. 
Injected (or otherwise acquired) bacilli will be distributed through- 
out the reticulo-endothelial tissues which consist of lymphocyte-producing 


centres. The bacilli stimulate these centres to pour sensitised lym- 
phocytes (antibody) into the circulation. Suppose (a) the bacilli are 
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distributed randomly among a much larger number of centres, (b) at 
least v bacilli are required to sensitise a centre, (c) the resulting 2 is 
proportional to the number of sensitised centres. If B bacilli are 
distributed among C centres and p,(B) and s,(B) are the mean propor- 
tions of centres with r and r or more bacilli respectively, then 


Cs,(B + 1) Cs,(B) = Pr-i(B) = 1, 2, 
or approximately, when B is large, 
Cs?(B) 8,-,(B) s,(B). 


Therefore 
(c44 1) 5B) 1 
the solution of which is 


s,(B) = 1 — + (B/C) + --- + — 


TABLE 4 
Loe Minimum TuBERCULIN CONCENTRATION (7) 
Injection- 
volume 0.1 ml. 0.2 ml. 
Average 
Dose of PPD | 10 T.U. 10 T.U. 1 TU. log 7 
Subject 
1 —0.76 | (—1.64) | (—0.94) | (—1.94) —0.76 3960 
3 —0.83 | (—1.64) | (—0.94) | (—1.94) —0.83 1655 
3 —1.02 | (—1.64) —1.04 | (—1.94) —1.03 1655 
4 —1.87 —1.83 —1.88 (-—1.94) —1.86 1655 
5 —2.41 —2.42 —2.29 —2.4l1 —2.38 3960 
6 —2.41 —2.42 —2.36 —2.41 —2.38 3960 
7 —2.41 —2.42 —2.44 —2.46 —2.43 3960 
8 —2.78 —2.77 —2.76 —2.82 —2.78 3960 
9 —2.78 —2.77 —2.85 —2.82 —2.80 2490 
10 —2.92 —2.77 —2.93 —2.95 —2.89 3960 
il —2.92 —2.77 —3.02 —3.01 —2.93 3960 
12 —3.19 —3.18 —3.21 —3.22 —3.20 3960 
13 —3.48 —3.41 —3.40 —3.36 —3.41 3960 
14 —3.79 —3.78 —3.81 —3.85 —3.81 8408 
15 —4.81 | —4.79 —4.73 —4.81 —4.7 8408 
16 —6.82 —6.57 —6.64 —6.77 —-6.7 8408 
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If B/C is small, we therefore expect \ « B’ or 
= log B+ (9) 


where 7 is a constant. The integer » provides the measure of bacillary 
virulence. 


Agreement with data. 


(i) Human Subjects. 39 female subjects, whose preliminary skin-tests 
showed no induration, were vaccinated with a range of B. C. G. doses 
and after six weeks were tested with 100 T. U.’s in 0.1 ml. The log 7’s 
calculated from equation (7) are given in Table 5. Each subject’s 
B. C. G. dose establishes a level of \. Table 5 shows that the variance 


TABLE 5 
Errsct or B.C.G. Doss on S1zE or INDURATION 
No. of Vol. B.C.G. Induration Average 
subjects in ml. diameter (mms) log + log + 
2 0.300 32.0 —7.72 —7.80 
1 0.300 32.5 —7.97 
2 0.250 30.0 —6.74 —6.90 
1 0.250 31.0 —7.22 
1 0.245 28.0 —5.82 —6.16 
1 0.245 29.5 —6.50 
1 0.155 25.5 —4.77 —4.77 
1 0.150 24.0 —4.18 —4.47 
3 0.150 25.0 —4.57 
1 0.140 24.0 —4.18 —4.18 
1 0.130 23.0 —3.81 —3.81 
2 0.120 22.0 —3.46 
1 0.120 22.5 —3.63 —3.59 
1 0.120 23.0 —3.81 
1 0.100 20.0 —2.79 —2.79 
1 0.080 16.0 —1.66 —1.79 
1 0.080 17.0 —1.92 
1 0.075 14.5 —1.30 
3 0.075 15.0 —1.41 —-1.41 
1 0.075 15.5 —1.53 
2 0.070 14.0 —1.18 —1.30 
1 0.070 15.5 —1.53 
1 0.065 12.0 —0.77 —0.77 
3 0.060 10.0 —0.43 
2 0.060 10.5 —0.51 —0.51 
3 0.060 11.0 —0.59 
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of Minimum Tuberculin Concentrations within dose-levels is much 
smaller than the between-dose variance. This implies that the constants 
y and 6 differ only slightly, if at all, among these subjects. If 6 is 
constant from subject to subject, it is clear from equation (8) that it 
is less than any + occurring. Therefore 6 < 10°"*’. The average of 


LOG 8B.C.G, ML. 


FIGURE 2 
Tue Errect or B.C.G. on THE Minimum TUBERCULIN CONCENTRATION 


log r~* is plotted against the log B. C. G.-dose in Figure 2. The least- 
squares straight-line has the equation 


log 7’ = 10.1 log B.C.G. + 12.9. (10) 


(95 percent confidence limits for the slope are 9.8, 10.4. A significantly 
better fit is possible by allowing a quadratic term but the improve- 
ment is only slight.) It is reasonable to extrapolate and state that 
higher B. C. G. doses would result in much lower 7’s than 10° 
T. U.’s/(mm)’. So for all subjects we will put 6 = 0 in equation (8) 
and write 


T= (11) 


Since y = ep and 6 = wp, the interpretation of 6 = 0 is that most of 
the tuberculin escapes without local reaction. 
As stated above, the variance of y from subject to subject must 
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be quite small. This is somewhat surprising since y is presumably 
related to skin-type. However, all 39°subjects were aged 18 or 19 
so that perhaps 7 is determined by age. The subjects of Table 4 throw 
some light on this. The relation of the area of fluid coverage constant x 
with age is as follows: 


= 1655, ages over 65; 

2490, age 55; 

3960, ages between 30 and 45; 
= 8408, ages under 30. 


K 
K 
K 
K 


The negative correlation is statistically significant (P = 0.05). There 
is a (non-significant) negative correlation of x and r. Accepting both 
correlations as real, there would be two possibilities consistent with 
the theory: (i) y is independent of age but both «x and \ decrease with 
age, the latter due to decreasing bacillary stimulation. In this case 
the determination of x by dye-injection has no relevance to the inter- 
pretation of the Mantoux reaction. (ii) y and x are correlated, both 
reing functions of skin-type. The data considered does not distinguish 
these two. One experimental approach would be to take negative 
reactors at different ages and induce possibly identical antibody levels 
by B. C. G. vaccination. 
Equations (10) and (11) combine to give 


log \ = 10.1 log B.C.G. + 12.9 + logy. 


The confidence interval for the slope includes the value 10 so that 
the results are consistent with equation (9) when vy = 10. 

(ii) Guinea-pigs. 

Each of five dose-levels of virulent Myc. Tub. bacilli was given 
by intraperitoneal injection to one of five pairs of guinea-pigs. The 
animals were tested with 100 T. U.’s in 0.1 ml. on the 10th, 18th, 27th 
and 35th days after injection. On one occ..sion each’ pair was also 
tested with 320 T. U.’s. The average diameters for each pair are 
given in Table 6 with the 320 T. U. results in brackets. The sinul- 
taneous diameters at 100 T. U. and 320 T. U. are consistent with equa- 
tion (1) with the same value of «* as for humans 126.7. For the dif- 
ferences Djo.o — Dig. are 56, 56, 61, 66 and 69 with an average of 62. 
By equation (1) the difference should be 64. If equations (7) and (11) 
also hold, the antibody levels, log (A/y), can be calculated. These are 
plotted in Figure 3. For these animals the bacillary content is increasing 
with time. If the increase is geometric, the actual bacillary content 
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* TABLE 6 


Tue INDURATION DIAMETERS PRODUCED BY DIFFERENT Dosis OF MycoBACTERIUM 
Tusercuosis (H 37 Rv) 1n Gurnga-P1Gs 


Dose of infecting organisms; Log mgms. 
Days wet weight of bacilli 
after - 
infection —1.796 —1.893 —1.990 —2.087 —2.184 

10 9 8 5 2 0 
18 10 9 7 5 (9) 0 
27 11 10 (12.5) 8.5 7 5 
35 12 (14.5) 11 10 (13) 8.5 7 (10.5) 


on the T'th day will be B,10*” where B, is the initial dose and k is the 
growth-rate constant. If equation (9) holds, then 


log (A/y) = v log By + vk log T + const. 


The least-squares estimates are § = 2.05, & = 0.0104 and const. = 3.72. 
The lines drawn in Figure 3 are based on these estimates. In view of 
the small number of observations a slope of 2.05 is consistent with a 
true value of » of 2. 
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If the models are valid, it must be concluded that, whereas at least 
10 B. C. G. bacilli are needed to activate a lymphocyte-producer in 
humans, only two virulent Myc. Tub. bacilli are required in guinea-pigs. 


DISCUSSION 


The conclusion of Section 1 of Results was that there exists a linear 
relation between the induration-area and the log-dose of tuberculin 
in constant volume. This contradicts the conclusion of Long and 
Miles [1950] with tuberculin and of Miles [1949] with diphtheria toxin 
that the linear relation is between diameter and log-dose. In Long 
and Miles [1950] it is stated that Wadley [1948, 1949] had found such 
a relation. However examination of Wadley’s paper shows that the 
bulk of the work established a linear relation between log-dose and the 
increase in thickness of-the induration produced by tuberculin injection 
in cattle. Miles [1949] rests the case for his linearity on visual inspection 
and analyses of variance. In his Table 1 there is a statistically signifi- 
cant departure from linearity but the associated F-value is small com- 
pared with that for the linear component. However, provided the 
range of reaction-diameters is not too large, this may also be expected 
to result from our equation (1). For example, with five doses at two- 
fold dilution intervals and the diameters ranging from 10 to 17.5 mms, 
the mean square for linearity would be several hundred times larger 
than the mean square for departures from linearity. In other words 
for this range the two theories are not clearly distinguishable. Miles 
observes, and it is apparent from his figures, that the departures from 
linearity tended to occur with small induration-diameters—“the curves 
‘dipping more steeply to the 3-5 mm values”. He invokes a special 
argument in explanation of this. However this kind of departure is 
precisely what our equation (1) would lead us to expect. 

In this paper an attempt has been made to present a validation of 
two models without considering whether they can be justified on more 
general grounds. A fuller discussion of their clinical and immunological 
implications will be published elsewhere. 

Further experimentation is clearly needed to deal with such questions 
as the following: 

(i) Are the constants or even the form of the dose-response relation 
of the intradermal test, dependent on experimental technique? 

(ii) Is the multi-hit model correct for bacilli other than BCG and 
the Mycobacterium Tuberculosis used? 


(iii) Is the approach adopted useful for diseases other than tu- 
berculosis? 
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OPTIMUM ESTIMATION OF GRADIENT DIRECTION 
IN STEEPEST ASCENT EXPERIMENTS 


SAMUEL H. Brooks anp M. Ray MIcKEyYy 
C-E-I-R, Inc. 
Los Angeles, California, U.S.A. 


Summary 


In the steepest ascent method for seeking maxima (Box and Wilson), 
the procedure involves the progression to a region of high response by 
successive inferences as to the gradient direction in sets of trials en 
route. When there is no experimental error, and the response surface 
is, well approximated by a plane in any locality, then it can be seen 
thac.an appropriately patterned set of trials, numbering just one more 
than the number of factors, would be required to completely charac- 
terize this plane and thus the gradient direction. Were there experi- 
mental error, however, the gradient direction would be correspondingly 
uncertain. One wonders whether to expect that more ground would 
be gained by having more trials in each set at the expense of having 
fewer sets. As this problem is formulated herein, there is a strong 
conclusion that the number of trials per set should be minimal. 


I. Introduction 


The procedure of steepest ascent (Box and Wilson, [1951]) consists 
of performing a sequence of sets of trials. Each set of trials is ‘‘centered”’ 
at a combination of factor levels, or point in factor space. An initial 
point is selected for the center of the first set of trials. Succeeding 
center points are determined by estimating the appropriate relative 
changes among the factor levels, i.e., the direction of the change, from 
the results of the set of trials just performed and scaling the increments 
of change such ‘that the new center point is a fixed distance, the step 
length, away from the old. This procedure is followed until a near 
stationary region is attained. 

This investigation is concerned with the allocation of effort to the 
trials in the initial stages of the search, where one is not likely to be 
close to a point of maximum response. It is assumed that the choice 
of scales for the factor levels has been decided upon, so that a distance 
is defined, and that a step length has been selected. The purpose of 
a set of trials is (and we here regard this as the sole purpose) then to 
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supply the direction of the “step” for determining the next center 
point. Box and Wilson [1951] have shown that the appropriate direction 
is that of the gradient at the new point. Consequently the purpose of 
a set of trials is to provide an estimate of the gradient direction. 

It is assumed that the response surface is well approximated by a 
plane in any locality. Then the gradient direction estimated as a result 
of a set of trials differs from that of the true gradient by some angle, 
say 6, when there is experimental error. The greater the number of 
trials in a set, the smaller @ is expected to be. Let us designate by S 
the size of step to be taken in the estimated gradient direction. As 
a result, S cos @ is the component of this step which is in the direction 
of the true gradient. If cos @ were near to one, the gradient would be 
well estimated and the step would be well taken. If cos @ were near 
zero, there would be comparatively little change in the response to 
be expected. If cos @ were negative, the step would be downhill. Thus, 
S cos @ may be regarded as the improvement in position in factor space 
as a result of a set of trials. 

Let us regard a trial as a unit of effort, so that for ¢ trials in a set, 
the improvement per unit of effort is (S cos @)/t. In planning a set 


of trials it is desirable to select ¢ so as to maximize the expected value 


of the improvement per unit effort, i.e. E[(S cos 6)/t]. This is equivalent 
to selecting ¢ so as to maximize (1/t) E(cos 6), where E(cos 6) is regarded 
as a function of t. 

Our main result is that the maximum is achieved when ¢ is one more 
than the number of factors; that is, it is not worthwhile to improve the 
precision of the estimate of the gradient direction at the expense of 
more trials. Since this result would ‘be anticipated if the experimental 
variation could be ignored, it is of interest that it applies independently 
of the magnitude of the variability. The result is convenient from the 
point of view of application, since otherwise the appropriate value of 
t would depend upon the magnitude of the gradient and upon the 
experimental variability, the ratio of which would havé to be estimated 
in advance of the set of trials. Consequently, the simplicity of the 
result enhances its value. 

The result is developed from the assumption that the trial design 
yields normally distributed uncorrelated estimates of the gradient com- 
ponents which have the same variance. The magnitude of the common 
variance will depend upon the design used. Although the present 
result does not evaluate the relative efficiency of alternative designs, 
the above formulation of the problem does provide a basis for com- 
parison by means of E(cos @). While we do not pursue this line of 
investigation here, some tabular values are given, so that alternative 


i 
| 
, 
i 
) 


50 BIOMETRICS, MARCH 1961 


designs can be readily compared from the point of view of efficiency 
for maxima seeking. 

The basic function E(cos @) is evaluated in Section II, and some 
numerical values are presented. The development is completed in 
Section III by applying the result of Section II to establish the main 
result of the paper. Some numerical values are presented of how the 
efficiency depends on the number of trials within a set. 

Brief mention should be made of related aspects of the maximum 
seeking problem that our formulation has not considered. The problem 
of recognizing that a near stationary region has been reached may 
present difficulties in the case of large experimental error. Another 
aspect is the effect of the curvature of the response surface both on the 
selection of the design of a set of trials and the size of the region over 
which the design is to be applied and consequently on the estimation 
of the gradient direction. Finally, we note that the prior selection of 
scales of the factor levels and step length also affect the estimate of the 
gradient direction. 

II. Derivation of E(cos @) 

Let us suppose that in the vicinity of a set of trials the relation 

between the response and the n factors is well represented by a plane: 


R = Bo + Bit + Bota + -+> + Bate +, 


where 2; is the level of the ith factor and « is the experimental error,’ 
NID(O, o?). 

Let us suppose, also, that as a result of a set of ¢ trials it is possible 
to construct an estimate b, of 8; such that the b; are NID(§, , 0”) and o” 
is a function of ¢. 

The angle between the true gradient 8, having components £8; , 
and the estimated gradient b, having components b, , is 6. Consequently: 


cos = >, >, B where 0< 

E(cos @) is to be found from the joint distribution of the magnitude 
and angle of error, 6 of the random vector b. It is possible to orient 
the factor space so that, without loss of generality, the components 
of 8 may be expressed, in units of c, as p, 0,0, --- ,0. Under the same 
transformation the components of b are y, , ¥2, °** ,Yn- The y; are 
NID with 


INID (0, ¢,%) means normally and independently distributed with mean sero and variance ¢ ,?. 
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and each with unit variance. The magnitude of b, in units of 5, is r. 
The joint distribution of r and @ is found from the joint distribution 
of the : 


Noting that the quantity in the brackets can be expressed by the 
cosine law, this function, in terms of @ and r, becomes: 


0) = 008 dr dH, 


where dH is the region over which r and @ are constant; that is, the 
surface ‘‘area’”’ of an (n — 1)-dimensional hypersphere of radius r sin 6. 
By integration over this region: 


1 — 1 
r(3)r ( 2 )e 


This provides an expression for the expected value of cos 0: 


dF(r, 0) = 


\ 


E(cos 6) = / cos 6 dF(r, 6). 
6=0 Jr=0 
Expanding e’* ©* @ in powers of rp cos @ leads to: 


2 2 


Each of the even j terms in the above expression are zero since, 
for even j, 


[ (cos 6)'*(sin 6)"-? dé = 0. 


For odd j this J integral may be put in the form of the beta function 
by letting X = sin’ 6: 


I=2 (cos 6)'**(sin de 


-[ er aX = 
r(2 +n— 
7 


The r integral evaluated as a gamma function becomes: 
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Assembling these terms and letting j = 27 + 1, there results: 


It may be noted that the above expression is closely related to 
the confluent hypergeometric function. 


An approximation to E(cos @) follows directly from the expression 
given for cos.@ by noting that 


E> = B; = pa’, 
E(> bi) = (n+ p’)o’. 
By substituting expected values for the random variables one obtains: 


E(cos) p/Vn + 


E(cos @) 


This approximation is in good agreement with the values in the table 


and illustrates the general nature of this function.’ 


III. Optimum Number of Trials in a Set 


In the previous section it was stated that p is in units of o and 
that o is a function of t. E(cos 9) is to be expressed as a function of 
t so that (1/t)E(cos 6) can be examined for the ¢ which maximizes this 
quantity; that is, the number of trials in a set which results in the 
greatest expected improvement per unit of effort. 

Since the §, are ordinarily estimated by means of linear functions 
of the observations, then o” would ordinarily be a simple multiple of 
o?. Were that set of trials replicated, say m times, then o” would be 
the same multiple of o?/m, which would be proportional to o7/t. Under 
these circumstances o* would be proportional to 1/t. It is suggested 
that this would be a good approximation of the functional relation of 
o and ¢ for more varied experimental designs than those which are 
replicated, as long as the trials would be made in the same locality. 
Consequently, since p’ is in units of o’, p° is proportional to ¢. It is 


‘It can be shown that a lower bound for E(cos @) is p/n + 92 + 1. 
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convenient to work with a quantity which is proportional to ¢ and 
therefore to p*. Let 7, k and h,(7’) be defined by the following: 


T = ki = ,’/2, 


n+2 
r(2+2 + if! 
As a result of differentiating this with respect to T: 


1 


(1/#)E(cos = h,(T) = ke” 


i)i 

It is apparent that for any positive n and any positive 7’, ¢ or p’, 
this derivative is negative. Consequently, (1/t) E(cos @) is a decreasing 
function of ¢, so that ¢ should be as small as possible. The minimum 
number of trials required to estimate a gradient by means of uncorrelated 
estimates of its components is one more than the number of factors. 
As a result, ¢ = n + 1 is the “optimum” number of trials to be used 
in each set of a maximum seeking experiment when such estimation is 
to be done. 

The simplex designs are very appropriate for this kind of experi- 
mentation. These and other useful first order designs are discussed 
in the reference by Box. In those cases where the factors can be assigned 


only discrete levels, the designs given by Plackett and Burman can 
be used. 


IV. Efficiency of Replication . 


It is of interest to see how efficient more than n + 1 trials in a set 
would be since other considerations may make it desirable to do as 
many trials as would be worthwhile within a single set. This would 
be the case, for example, if in some agricultural applications, many 
trials may be carried out simultaneously, but it takes an entire growing 
season to do any set of trials. 

As a notational convenience, the number of trials for which t = n + 1 
is designated as a single replicate, R = 1, andt = R(n + 1) isan Rth 
replicate. Let @ be the value of p when R = 1. Thus, G is the magni- 
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tude of the gradient relative to the standard error of estimation of a 
component when the number of trials used to do this estimation is 
n + 1.. For those experimental designs for which the standard error 
of component estimation is reduced in proportion to R~'”’, e.g. “pure” 
replication, there results the relation p = R'’G. The efficiency of 


TABLE 2 
Errictency oF R, tHE NuMBER OF REPLICATES, FOR n, THE NUMBER OF Factors 
AND G, THE MAGNITUDE OF THE GRADIENT RELATIVE TO THE 
STaNDARD ERROR OF EstiMATION WHEN 2? = 1 


The Number of Factors isn = 2 


Relative Magnitude of the Gradient 
Replicates, R | 0.25 0.5 | 1 2 
\ 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 
2 0.702 ; 0.686 | 0.621 | 0.550 | 0.510 | 0.500 
4 0.489 | 0.459 | 0.369 | 0.286 | 0.257 | 0.250 
8 0.334 | 0.292 | 0.203 | 0 146 | 0.129 | 0.125 
16 0.224 | 0.174 | 0 106 0.073 | 0.065 0.0625 
The Number of Factors isn = 8 
Relative Magnitude of the Gradient, @ 
Replicates,R | 0.25 | 05 
—- 
1 1.000 1.000 | 1.000 1.u00 1.000 1.000 
2 0.705 | 0.699 | 0.676 | 0.618 | 0.547 0.500 
4 0.495 | 0.482 | 0.411 0.358 | 0 287 0.250 
8 0.346 | 0.326 ; 0.272 | 0.196 | 0.149 0.125 
16 0.239 0.213 | 0.158 0.103 ) O76 0.0625 


The Number of Factors is n = 32 


Relative Magnitude of the (iradiernt, G 
Replicates, 0.25 0.5 | i 2 j 
1 1.000 | 1.000 | 1.000 | 1.0v0 000 1,000 
2 0.706 | 0.704 0.697 0.672, 9.614 | 0.500 
4 0.499 | 0.495 | 0.480 | 0.435 © 0.354 | 0.250 
8 0.351 | 0.345 | 0.322 | 0.267 9 9.196 | 0.125 
16 0.247 | 0.237 | 0.209 © v.154 | 0 105 | 0.0625 
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an Rth replicate for particular n and G is defined as the improvement 
per unit of effort expected as a result of the Rth replicate design as | 
compared with the improvement per unit of effort expected when R = | 


7 Blcos 0, p = VRG) 


E(cos @|n, p = G) 


Efficiency of (R|n,G) = 
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A STOCHASTIC STUDY OF THE LIFE TABLE 
AND !'TS APPLICATIONS 


Ill. THE FOLLOW-UP STUDY WITH THE CONSIDERATION 
OF COMPETING RISKS*” 


Lone CHIANG 
University of California, Berkeley, California, U.S.A. 


INTRODUCTION 


Statistical studies falling into the general category of lie testing 
and medical follow-up have as their common immediate objective the 
estimation of life expectation and survival rates for a defined population 
at risk. Usually such a study must be brought to a close before all 
the information on survival (of patients, electric bulbs, automobiles, 
etc:) is complete, and thus the study is said to be truncated. Whether 
the investigation is basically concerned with life testing or with medical 
follow-up, the nature of the problem is the same, although differences 
in sample size may call for different approaches. Thus methods devel- 
oped for life testing may be applied to follow-up studies when the under- 
lying conditions are met, and vice versa. In this study cancer survival 
data utilizing a large sample will be used as illustrative material, and 
we shall accordingly use the terminology of the medical follow-up study 
as a matter of convenience. 

We are concerned then with a typical follow-up study in which a 
group of individuals with some common morbidity experience are 
followed from a well-defined zero point, such as date of hospital admis- 
sion. Perhaps we wish to evaluate a certain therapeutic measure by 
comparing the expectation of life and survival rates of treated and 
untreated patients. Or we may wish to compare the expectation of 
life of treated and presumably cured patients with that of normal 
persons. When the period of observation is ended, there will usually 
remain a number of individuals on whom the mortality data in a typical 
study will be incomplete. Of first importance among these are he 


1This study was completed while the author was a Special Research Fellow of the National Heart 
Institute, Public Health Service, U. S. Department of Health, Education and Welfare. 

*Parte of this paper were presented at the joint meeting of the American Statistical Association, 
the Institute of Mathematical Statistics, and the Biometric Society in Washington, D. C., December 
29, 1959. 


57 


‘ 
a 
£ 
t 
~ 
| | 
7 
i 
& 
3 
: 
| 
vee 


gue 


58 BIOMETRICS, MARCH 1961 


persons still alive at the close of the study. Secondly, if the investi- 
gation is concerned with mortality from a specific cause, the necessary 
information is incomplete and unavailable for patients who died from 
other causes. In addition, there will usually be a third group of patients 
who were “‘lost’’ to the study because of follow-up failure. These three 
groups present a number of statistical problems in the estimation of 
the expectation of life and survival rates. Many significant studies 
have been made along these lines, among them the early works of 
Greenwood [13] and Karn [17], the actuarial method of Berkson and 
Gage [1], a stochastic mode! of competing risks by Fix and Neyman [12], 
the parametric studies by Berkson and Gage [2] and Boag [3], the 
non-parametric approach of Kaplan and Meier {16}, studies on life 
testing by Epstein and Soble |10] and other interesting works by Dorn 
3], Elveback [9], Fix [11], Harris, Meier and Tukey [14], and Littell [18]. 

The purpose of this paper is to adapt the biometric functions of 
the life table to the special conditions of the follow-up study. Part I 
considers the general type of study in which survival experience is 
investigated without specification as to the cause of death. An exact 
f-rmula will be presented for the inaximum likelihood estimator of 
tt probability of death and its asymptotic variance. Special attention 
wv. he given to a method for computing the observed expectation of 
life in truncated studies and the corresponding variance. In Part II 
the discussion will be extended to apply to studies of mortality from 
a specific cause in the presence of competing risks. The relations 
between net, crude, and partial crude probabilities will be reviewed 
and formulas developed for their estimators and the corresponding 
variances and covariances. In the last part of the paper, data obtained 
from the California State Department of Public Health will be used 
to illustrate the application of the theoretical matter presented in 
Parts I and IT. 

Throughout this paper we shall assume that all individuals in a 
sample are subject to the same force of mortality (or, instantaneous 
death probability), and that the probability of dying for one individual 
is not influenced by the death of any other individual in the group. 
This is to say that the life-times of all individuals in a group are treated 
as independent and identically distributed random variables. We shall 
also assume in Part I that there will be no individuals lost to observation 
because of follow-up failure. ‘The problem of lost cases will be con- 
sidered in Part II (Remark 3). 

For simplicity of presentation, a constant time interval (year) will 
be used. However, the methods developed in this paper apply equally 
well to cases where intervals are of different lengths; although the ob- 
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served expectation of life will have a slightly different form (Cf. [5] 
and [6]). 

The probability symbols used in this paper are listed below for 
convenient reference. Considering death without specification to cause: 


p. = Pr [an individual alive at time z will survive the interval 
(x, z + 1)], 


q. = Pr [an individual alive at time zx will die in the interval 
(z,x + 1)], 


and obviously p, + g. = 1. When death is studied by cause, or risk, 
we have the net probabilities: 


q:k = Pr [an individual alive at time z will die in the interval 
(x, x + 1) if risk R, is the only acting risk of death in the 
population], 


q:.2 = Pr [an individual alive at time zx will die in the interval 
(x, x + 1) if risk R, is eliminated from the ponulation]; 


the crude probability: 


Q., = Pr [an individual alive at time x will die from cause R, in 
the interval (z, x + 1), in the presence of all other risks 
in the population]; 


and the partial crude probabilities: 


Q.:.1 = Pr [an individual alive at time zx will die from cause R, 
in the interval (zx, x + 1), when only risk R, is eliminated 
from the population], 


Q.s.12 = Pr [an individual alive at time zx will die from cause RP, 
in the interval (x, x + 1), when risks R, and R, are elimi- 
nated from the population]. 


PART I. THE ESTIMATION OF THE PROBABILITY OF SURVIVAL 
AND EXPECTATION OF LIFE 


1.1. The basic random variables and their joint probability function. 


Consider a follow-up study conducted over a period of y years. A 
total of N, individuals are accepted into the study at any time prior 
to the closing date® and are observed until death or until the study is 
terminated, whichever comes first. If we set the time of entrance into 


*Although we have used the common closing date method to illustrate the techuiques developed 
in this paper, it should be pointed out that these techniques are equally applicable to the date of last 
reporting method. 
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the study as the common point of origin for all N, individuals, then 
N, is taken to be the number with which the study began, or the number 
of individuals alive at time zero. Let z be the exact number of years 
since entrance into the study, and N, the number of individuals who 
survive to the common point z. Clearly, N, may also be defined as 
the number of survivors who entered the study at least x years before 
its closing date. The number of survivors will decrease as z increases, 
not only because of deaths but also because of withdrawals due to the 
closing of the study. We will describe this process of depletion system- 
atically for the typical interval (z, x + 1) with reference to Table 1.* 

At time z, the N, survivors who begin the interval can be divided 
into two mutually exclusive groups according to their date of entrance 
into the study. A group of m patients entered the study more than 
x + 1 years before the closing date of the study. Out of these, 6 patients 
will die in the interval and s will survive to begin the next. interval. 
The second group of n patients entered the study less than zx + 1 
years before its termination, and hence are all counted as withdrawals 


TABLE 1 


DistarBuTION oF N, Patients AccorDING TO WITHDRAWAL Status, SURVIVAL 
Sratus, AND CausE OF DEATH IN THE INTERVAL (z, z + 1)* 


Withdrawal status in the interval 
Survival status and Total number Number not due | Number due for 
cause of death of patients for withdrawal** withdrawal*** 

Total Ne m n 
Survivors s+w 8 w 
Deaths, all causes D é € 
Cause of death 

R: Dz 6: 

R e D e 6 c € e 


*Yhe subscript z, which should be added to each of the symbols in the table. is deleted to simplify 
formulas in the text. 
**Patients admitted to the study more than (z + 1) years before closing date. 
***Patienta admitted to the study less than (z + 1) years but more than z years before closing 
date. 


(The second part of Table 1, death by cause, was included for use in Part II of this paper. 
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in the interval (z, z + 1), whether or not they survive, because for 
them the closing date precedes their (x + 1)-anniversary date. Let 
us say that « will die before the closing date and w will survive to with- 
draw alive. 

Thus s, 5, w, and « are the basic random variables (upper part of 
Table 1) whose distribution depends on the force of mortality. The 
values that these random variables take on will be used to estimate the 
probability p, that a patient will survive the interval (z, x + 1), and 
its complement g, , the probability of death in the interval. The first 
step is to derive the joint probability function of these random variables. 

Let u,, a function of time r, be the force of mortality acting on 
each individual in the study, such that 


u, Ar + o(Ar) = Pr [an individual alive at time 7 will die in the 
interval (7, r + Ar)], for 7 > 0, 


where Ar stands for an infinitesimal time interval and o(Ar) a quantity 
of smaller order of magnitude than Ar. It can be shown [5] that 


p(t) = exp My a.} 


is the probability that an individual alive at x will survive to (x + 2). 
If we assume a constant force of mortality within the interval (xz, z + 1), 
Say wu, = uw, depending only on z, for r < r+ S x -+ 1, then the proba- 
bility of surviving the interval is given by° 


Pz = @ 
For the subinterval (z, x + ¢), with ¢ between zero and one, we have 
p(t) = ps, for O<t <1. 


Consider first the group of m individuals, each of whom has a constan _ 
probability p, of surviving and a probability g, = 1 — p, of dying in 
the interval (x, z + 1). We have then a typical binomial case with 
the probability function: 


= pi(l -- p,)’. (1) 
The expected number of survivors and deaths are given, respectively, by 
E(s|m) = mp,, and E(5|m) = m(1 — p,). (2) 


The distribution of the random variables in the group due for with- 
drawal is not so straightforward. Making the assumption that, on 


‘When the assumption of a constant force of mortality is strong for an interval where the death 
rate is high, one may subdivide the interval and estimate the probability for each subinterval separately. 
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the average, each of the m individuals will withdraw at the point z + 3, 
the probability of withdrawing alive is equal to p}””, and the probability 
of dying before the time of withdrawal (1 — p}’ 2), Again we have a 
binomial case with the probability function® 


= ps (1 — (3) 
.. The expected number of survivors and deaths are given, respectively, by 
E(w | n) = npy”, and E(e|n) = n(1 — py”). (4) 


Since the N, individuals are divided at time z into two distinctly different 
groups according to their withdrawal status, the joint probability of 
all the random variables is the product of the two probability functions 
(1) and (3), 


-  thh= — — py’. (5) 
1.2. The maximum likelihood estimators and their asymptotic variances. 


We are now in a position to use the maximum likelihood principle 
to. obtain the estimator of the probability p; and its complement gq, , 
and the asymptotic variance. Taking the logarithm of the joint proba- 
bility function (5), we have the likelihood function 


L = (s+ + (© 
Differentiating (6) with respect to p, and setting the derivative equal 
to zero give the likelihood equation 

which implies 


a quadratic equation in ,”. Since #)* cannot take on negative values, 
we have the estimators, 


[= he + Vie + 4(N, + 
2(N, — 4n) 


(8) 


*A more plausible assumption perhaps is that a withdrawal takes place randomly throughout the 
interval. However, under this assumption the probability of withdrawing alive is 


dr = —(1 — ps)/inps, 


and the resulting maximum likelihood equation is too unwieldy. i 
not too low, the above expression and ps! ars close to each other. For ps = .70, for example, ps? = .837 


and —(1 — ps)/in ps = .840. In the couse of extremely high mortality, one should subdivide the 
interval. 
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and 


(9) 
The maximum likelihood estimator (8) is not unbiased; however, it is 
consistent in the sense that when the random variables s, w, and « are 
replaced with their respective expectations as given by (2) and (4), 
the resulting expression is identical with the probability p,. That is, 


— pt) + — pi)? + 4(N, — 3n)(mp,+ 


z 


2(N, 3n) 


To derive the formula for the asymptotic variance of the estimator 
p. , (or §,), we find the expectation of the second derivative of (6): 


where 
M,=m+n(1 +p,”)" (11) 
and 


According to the theorem on the asymptotic efficiency of an estimator, 


the asymptotic variance of #, (or §,) is given by the negative inverse 
of the expectation (10), 


o3, = 1/(M./(.4) + 7). (13) 


Usually the quantity x in the denominator of (13) will be small in 
comparison with the preceding term, and may be neglected to give the 
approximate formula 


o3, = p:9/M. . (14) 


The sampla variance of #, (or §,) is obtained by substituting (8) and 
(9) in (13)-or (14). 


Remark 1: The problem discussed here relates to the study in 
which N, is large. If N, is small, one may use the exact time of death 
of each of the D patients and the exact time of withdrawal of each of 
the w patients to estimate the probability p, . In this case, there will 
be N, individual observations within the interval (z, z + 1), and ob- 
viously it is unnecessary to consider the N, patients as two distinct 
groups according to their withdrawal status. Let ¢; < 1 be the time 
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of death within the interval (x, z + 1) of the i-th death, fori = 1, --- ,D, 
with a probability 


eu dt,, for i=1,---, D; 


let T; < 1 be the time of withdrawal alive of the j-th withdrawal, for 
j = 1, --- , w, with a probability 
for j= 1, +++, w; 


then the joint probability function of all the N, observations becomes 


D 
(e+ D 


P: dé;) I = (—Inp,)"p, Il dt, . 
Maximizing the last expression with respect to p, gives the maximum- 
likelihood estimator (cf. [18]), 


em| (++ + 


1.3. Observed expectation of life. 

A life table for the follow-up subjects can be readily constructed, 
once #, and @, have been determined from (8) and (9) for each interval 
of the study period. Let an arbitrary number |, denote the number 
of patients admitted to the study. The number |, who survive to the 
exact time z " computed from the formula l, = fof; --- f,-1 , and 
L,/lo = Pop: -+* Pe-: is the estimated z-year survival rate. For a patient 
alive at time z, the observed expectation of life can be expressed by 
the equation: 


where a, is the average time lived in the interval (z, z + 1) by the 
patients who die in that interval, and c, = 1 — a,_, + a,. If, ina 
study covering a period of y years, there are no survivors remaining 
from the patients who entered the study in its first year, f,-, will 
be zero, and é@, can be computed readily from the collected data. In 
the typical study, however, there will be w,_, survivors who entered 
the study in its first year and withdraw alive in the final interval 
(y — 1, y). In such cases, it is evident from (8) that f,_, is greater 
than zero and the values of 7, , #,., , --- are not observed within the 
limits of the study. Consequently, é, cannot be computed from equation 
(15). 
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It is nevertheless possible to estimate é, with a certain degree of 
accuracy if w,_, is small. Suppose we rewrite equation (15) in the 
form 


4, + + + + CyP:Pr+1 By-1 


+ (CysiDy + + °° ‘); (16) 


where 1, /l, is written for . The problem is to estimate 
the values of #, , fys: , --* in the last term, since the preceding terms 
can be computed from the data available. 

As a first approach, consider a typical interval (z, z + 1) beyond 
time y with the probability of surviving the interval: 


z+1 
p. = exp(—f dr) for z=y,yt+1,°°°. 


If the force of mortality is constant for z > y, the probability of survival 
is independent of z and we may write 


p,=e"*=p, for z=y,yt1,°°°. 


Under this assumption c, = 1, and we may replace the last term of 
(16) with (1,/l,) (6 + + ---), which converges to (l,/l,)p/(l — 
As a result, we have 


é=a,+ + + ++ 


Clearly, # may be set equal to #,-_, if the force of mortality is assumed 
to be constant beginning with time (y — 1) instead of time y. From 
the point of view of sample variation, however, it is desirable to base 
the estimate # on as large a sample size as possible. Suppose there 
exists a time 7’, for T < y, such that jr , prs: , --+ are approximately 
equal, thus indicating a constant force of mortality after time T. Then 
p may be set equal to #7 , and the formula for the observed expectation 
of life becomes (cf. [4]): 


forz = 0,---,y— 1. When 4g, is approximated with }, c, = 1. 
Although formula (17) holds for z = 0, --- , y — 1, it will be ap- 
parent to the reader that the smaller the z,.the larger the value of I, , 
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and the smaller will be the contribution of the last term. If the ratio 
1,/!, is small, the error in assuming a constant force of mortality beyond 
y and in the choice of #, will have but little effect on the value of @, . 


1.4. The sample variance of the observed expectation of life. 

To avoid confusion in notation let us denote by a a fixed number 
and consider the observed expectation of life ¢, as given in formula (17). 
It was proven in [5] that the estimated probabilities of surviving any 
two non-overlapping intervals have 8 zero covariance, and hence the 
sample variance of the observed expectation of life may be computed 
from 


Si. = (18) 


whert the derivatives are taken at the observed point #, , for z > a. 
Ta the present case, we have 


and 


= Parkers: + (1 — ar) + —fr)'}], for a < T, (20) 


= pr)’, for a> 
where = . Substituting (19) and (20) in (18) 
gives the sample variance of é, , 


for a < T, and 


Si. = + (1 a,)) Si, 
+ for a >T. (21a) 


The value of #, and the sample variance of §, are obtained from formulas 
(8) and (13), respectively. When a, is approximated with 3, the quantity 
(1 — a,) in formulas (21) and (21a) may be replaced by 4. 


PART II. CONSIDERATION OF COMPETING RISKS 


2.1. Relations between net, crude, and partial crude probabilities. 


In a follow-up study, as in general mortality analysis, one may be © 
interested in death due to a specific cause, or to a group of causes. 
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Depending upon the questions to be answered, the investigator may 
explore three general types of probabilities of death with respect to 
a specific cause, or risk: 


1. The crude probability. The probability of death from a specific 
cause in the presence of all other risks in a population. 

2. The net probability. The probability of death if a specific cause 
were the only cause in effect in the population or, conversely, 
the probability of death if a specific risk were eliminated from 
the population. 

3. The partial crude provability. The probability of death from a 
specific cause in the presence of all other risks but with a second 
risk eliminated from the population. 


Obviously, in the human population, the net and partial crude proba- 
bilities usually cannot be estimated directly except through their 
relations with the crude probability. The study of such relations is 
part of the problem of “competing risks”, or ‘“multiple-decrement”’. 
The subject has been variously discussed in the literature (see, for 
example, [12], [15], and [19]) and will be reviewed here only by way of 
introducing notation. 

Assume c risks of death (or causes) acting simultaneously on each 
individual of a population (that is, competing for the life of the indi- 
vidual), and let these risks be denoted by R, , --- , R.. For each 
risk there is a corresponding force of mortality, v,, , --- , vr. , each 
of which is a function of time 7, and the sum of these 


is then the total force of mortality. Within the time interval (2, x + 1), 
we shall assume a constant force of mortality for each risk, say 
Yer = , depending only on z and k, forz <r < 1. For all 
risks, we have un, = uw, ,forr <<7r< 

Let Q,.(¢) be the crude probability that an individual alive at time 
zx will die in the interval (x, x + for 0 < ¢ < 1, from cause R, in 
the presence of all other risks. It follows directly from addition and 
multiplication theorems that 


z+t 


The first factor of the integrand is the probability of survivjag from 
x to r when all risks of death are acting, while the second factor is 
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the instantaneous probability of death from cause R,. Integrating (23) 
gives 


= = [1 — p.(d)], 


for O<t<1; k = 1, ++-,¢. (24) 


It is clear from (22) that the sum of the crude probabilities in (24) 
is equal to the complement of p,(é), or 


Q.,(4) + eee + Q..(#) + p.(t) 1, for 0<? 3. (25) 


For ¢t = 1, we shall abbreviate Q.,(1) to Q,. , etc. For the purpose of 


this study we are particularly interested in the subinterval (x, z + 3), 
with 


= = Qall + for k=1,---,c. (26) 


In this case the sum of the crude probabilities (26) is the complement 
of p}”’, the probability of surviving half the interval, and 


+ + + + + pi? = 1. (27) 
When risk R, acts alone, the net probability that an individual 
alive at time z will die in the interval (z, zx + 1) is 
From formulas (24) and (28) we obtain the relation between the net 
and crude probabilities 


Qea/es 


By analogy we can write the net probability of death in the interval 
(x, x + 1) when risk R, is eliminated, 


Qs = 1—p, for k=1,---,¢. (30) 


Now suppose that R, is eliminated as a cause of death, and let 
Q..., be the partial crude probability that an individual alive at time 
zx will die in the interval (z, x + 1) from cause R, in the presence of all 
other risks, for k = 2, --- , c. Using a similar reasoning as in the 
crude probability [eq. (23)], we can write 


z+1 
—(r—z) 21) 
z 


Qu 


], for k= 2,---,e. (81) 


; 


69 


STUDY OF THE LIFE TABLE 


Similarly, if risks R, and R, are eliminated, the partial crude proba- 
bility that an individual alive at x will die in the interval (x, + + 1) 
from cause R, in the presence of all other causes is given by 


A detailed discussion on the partial crude probabilities is given in [7]. 


2.2. The basic random variables and their joint probability function. 


The identification of the random variables in a follow-up study 
in the presence of competing risks and the derivation of their joint 
probability function follows directly from the discussion in Section 1 
of Part I. The deaths in each of the two groups according to withdrawal 
status are further divided by cause of death as shown in Table 1, Part I. 

Each of the m individuals not due for withdrawal in the interval 
(x, x + 1) will fall into one of the c + 1 mutually exclusive groups, 
depending upon whether he survives the interval or dies from cause 
R, , -:: , R. , with the sum of the corresponding probabilities p, , 
Q.:,°** » Qee , equal to unity [eq. (25)]. Thus we have a multinomial 
case with the probability function 


fh = (33) 
where s is the number of survivors and 6, is the number of deaths from 
cause R, , for k = 1, --- , c. Their mathematical expectations are 
given respectively, by 
E(s | m) = mp,, and E(é,|m) = mQ,,, for k= 1, (34) 


Each individual in the group of n due for withdrawal in the interval 
(x, z + 1) has the probability p}’* of withdrawing alive and the proba- 
bility 

Q.(1 + for k= i, (26) 
of dying from cause R,. Since p}” and the probabilities in (26) again 
add up te unity [eq. (27)], the n observations also constitute a multi- 
nomial case with the probability function (cf. footnote 6) 


ec 


fo = pr” TT + (35) 


k=l 


where w is the number of individuals withdrawing alive and «, the 
number of individuals who die from cause R, before the time of with- 
drawal. The mathematical expectations are, respectively, 


E(w |n) = np}”, and E(a|n) = nQ.u(1 
for k= 1,---,¢. (36) 
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Because of the separation of the individuals into two distinct groups 
at time z according to their withdrawal status, the joint probability of 
all the random variables in Table 1 is the product of the two joint 
probabilities (33) and (35): 


fide = TT Ott TT + (37) 


Formula (37) may be simplified by rearranging terms and using the 
relations, D, = 6, + & , and « = ¢, + -+- + «,, to give the final form 
of the joint probability function 


hte = + II . (38) 

2.3. Mazximum-likelihood estimators of crude, net, and partial crude 
probabilities. 

We will again use the maximum likelihood principle to obtain the 


estimators of the probabilities: p, , Q., , --- , Q.- . In this case the 
likelihood function obtained from (38) is 


L = (8 + 4w) Inp, — In (1 + 


69 
where the substitution 


Qu (40) 


has been made. Differentiating the likelihood function (39) with respect 
to p. , Qs, -** , Q.. , respectively, and setting the derivatives equal 
to zero, we obtain a system of c simultaneous equations: 


aL /ap, = [(s + 4w)/p.) — + — (D,/Q.:) = 0, (418) 
(D,/Q.1) (D,/@.:) 0, for k= 2, (41 b) 
From (41b) it can be deduced that 
D./Ou = — for k = 1, (42) 
and therefore the ratio D,/Q,, in equation (41a) can be replaced with 
D/(1 — #.). When this substitution is made and the terms in (41a) 
are rearranged we have a quadratic equation in 7)” that is identical 
to equation (7) in Part I. Hence the estimators #, , in (8), 
and 4, for all causes of death will have the same value as in the simple 


case where death is investigated without specification to cause, as one 
would anticipate. Substituting (8) in (42) gives the estimators 
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= (D,/D)4. » for k=1,---,¢. (43) 


To obtain the estimators of the net and partial crude probabilities, 
we substitute (43) in formula (29), (30), (31), and (32), and after 
simplification, 

= for k =1,---,¢; (44) 

for k=1,---,¢; (45) 

Qa. = [Di/(D — — for k=2,---,c; (46) 
and 

= [Di/(D — D, — — for k = 3, «++, (47) 


The estimators given in formulas (44), (45), (46), and (47) are also 
maximum likelihood estimators because of the invariance property of 
maximum likelihood estimators. 


Remark 2: If there were no withdrawals in the interval (7, z + " 
i.e., if nm = 0, the problem is reduced to the classical multiple-decrement 
problem, with s survivors and D, = 6, deaths from cause R, , for 
k = 1, --- ,c. These random variables will still have a multinomial 
distribution [eq. (33)], and the formulas for the estimators of p, , @. , 
and Q,, are reduced to 


p: = 8/N, , (8a) 
4. = D/N, 5 (9a) 

and 
Q. = D,/N., for k=1,---,c. (43a) 


Formulas (44) through (47) may still be used for the estimators of 
the net and partial crude probabilities, but with #, given by (8a). 


Remark 3: The problem of cases lost to the study due to failure 
of follow-up is still unsolved, and perhaps it has no unique solution. 
Since the probability that a patient will be lost to follow-up is in part 
dependent upon the type of a study, assumptions with respect to lost 
cases may be valid for one study but not for another. If the number 
of lost cases is small. depending upon the type of study, one of the 
following assumptions may be made and the data handled accordingly: 
(1) patients lost will have the same probability of surviving as patients 
not lost, and may be deleted from the study; (2) all lost cases survive 
to the close of the study; (3) all die at the time of becoming lost; and 
(4) becoming lost is another competing risk. If sufficient knowledge 
of follow-up is unavailable, the fourth alternative is preferred. 
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2.4. Asymptotic variance and covariance of the estimators. 


Formulas for the variance and covariance of the estimators may 
be determined by using the asymptotic property of maximum-likeli- 
hood estimators. The inverse of the asymptotic covariance matrix 
of the estimators, , Q.2, --: , Q.. , is given by 


| 
1 X (ce — 1) 


L (e-—1)X1 


|| | 
- 


in which the elements are obtained by differentiating formula (41a) 
and (41b). Direct calculation gives the following mathematical ex- 
pectations 


—E(a°L/ap. 20.) M./Q: ’ for k= 2, (50) 
= '),, for h#k;h,k = 2, (51) 


and 
= M,{(1/Qn) + (1/Qx)], for k = 2,---,c¢, (52) 


where M, and = are defined by equation (11) and (12), respectively, 3 
of Part I. Substituting the respective expectations into (48), we have 


Pz zl 


Qe 


| 
j t 
a 
—E(0°L/ap*) M,{(1/p.) + (1/Q.)] + 2, (49) ( 
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with its determinant 


A= (M2/Qe Q.-Pz) + Q..)(1 = (54) 


Denoting the cofactors of the determinant by Ay, , forh, k = 1, --- ,¢, 
the formulas for the asymptotic variance and covariance are given by 


o3, = Ay,/A = (p,q./M,)[1/{1 + (55) 


for k=2,---,c, (56) 
= Au/A = + 
for k = 2, ---,¢, (57) 


and 


= A,,/A= + (xp./M,)}/{1 + 
for h,k = 2,3, ---,¢. (58) 


Since the term Q,, was not explicitly included in the likelihood function 
(39), the formulas for the variance of Q,, and the covariances between 
Q,, and other estimators were not presented. It is obvious by reason 
of symmetry, however, that expressions for Q,, do start from Q., , 
which is to say that formulas (56), (57), and (58) hold also for k = 1. 

The quantities inside the square brackets in formulas (55) through 
(58) may be approximated with unity when M, is moderately large. 


These formulas then reduce to familiar expressions of the multinomial 
case: 


= , (59) 
= Q.:)/M, for k= 1, (60) 
= —7p.Q../M, for k= (61) 


and 
= —QnQn/M., for h,k = 1,2, (62) 


Formulas for the asymptotic variance and covariance of the esti- 
mators of the net and partial crude probabilities can be obtained with 


the same approach as employed in [7]. To save space, only two formulas © 


are presented below. 


In (1 ges) en) + 68) 
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for k = 1, --- , c, for the net probability of death when risk R, is 
eliminated, and 


= — Qa — Qu)/{M.(q. — 
+ — — — Q1)}] 
— + Q.:p.(In (64) 
for k = 2, --- , c, for the partial crude probability. 


AN EXAMPLE OF LIFE TABLE CONSTRUCTION 
FOR THE FOLLOW-UP POPULATION 


The application of the methods developed in Parts I and II will 
be illustrated with data collected by the Tumor Registry of the Cali- 
fornia State Department of Public Health. The material selected 
consists of 5982 patients’ admitted to certain California hospitals and 


clinics between January 1, 1942, and December 31, 1954, with a diag-. 


nosis of cancer of the cervix uteri. For the purpose of this illustration, 
the latter date is taken as the common closing date of the study; the 
date of entrance to follow-up for each patient is the date of hospital 
admission. 

The first step is to construct a table similar to Table 2, showing the 
survival experience of the patients grouped according to their with- 
drawal status. The interval length selected (column 1) will depend 
upon the nature of the investigation; in this case a fixed length of one 
year was convenient and satisfactory. The total number of patients 
admitted to the study is entered as N, in the first line of column 2, 
which in this example is 5982. To determine their withdrawal status 
in the first interval (0, 1) the patients were separated into two groups: 
admissions before 1954, and consequently at least one year before the 
close of the study; and admissions during the year 1954, all due for 
withdrawal since the study was terminated before their first anniversary. 
Of the patients admitted prior to i954, s, (4030 in column 3) survived 
to their first anniversary and 6, (1287) died during the first year. The 
deaths. were further divided by cause into 49, (1105) deaths due to 
cancer of the cervix uteri and 5. (182) deaths from all other causes. 
The survival status of the 1954 admissions is determined at-the close 
of the study, as it is for patients due for withdrawal in any interval. 


In this study, w. (576) patients withdrew alive in the first interval, ~ 


and « (89) patients died before the closing date. These deaths were 
again divided by cause into ¢, (70, column 9) and «, (19, column 10). 


7An additional 251 cases of uncertain survival status were deleted from this illustration. 
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~ 
Ae 
4 
| 
| 
| 
‘tg 
f 
4 
f 
we 
4 


STUDY OF THE LIFE TABLE 


BIUIOJ IVDO Jo ‘GIVER jo ONIN], :eoInog 


0 0 0 0 0 0 0 
if if 08 € if 
4 if € 82 9 8ST 
if € ¢8 II OI-6 
I 9IT L PLP 6-8 
I 4 TOT II II PLY 099 8-2 
0 4 4 9T (a3 099 198 L-9 
908 9% 19 48 OLIT 
9 8 FI 6LE SIL. 1ST LIIZ 
ST 61 Lad ONG, LIIZ 
61 OL 68 92g SOTT L8ZT 2869 1-0 


19730 aq} Jo jo own 12710 Jo JO 
aoquin N qequin N qe 
Ut [VMBIPYFIM 10; onp you sequin 


PS6I-ZPGI ATIVILIN] 
XIAUZO AHL 40 UMONVD AO SIBONOVIG] ONIMOTIOY 


ATaVL 


| 
75 
= 
| 
'~o 
= 
| 
| | 
| 
| 
| wel | 
S 
| 
> 
l o 
~— 
3 
3 
3 
B 
r 
, foc) 
. ~ 
d 
e 
3. ~ 
N 
e 
l, ‘| 
° 
| 


BIOMETRICS, MARCH 1961 


TABLE 3 


SURVIVAL DiaGNosis oF CANCER OF THE CERVIX UTERI: 
Tre Marx Lire FuNncTIONS AND THEIR STANDARD 


Interval | 

since | Estimated probability Observed 

diaguosis | x-year survival rate of death in interval expectation 
(years; Doz (z,z + 1) of life at z 

i 
! 1000p | 1000S;,, 1000 4. 1000 és 
-| 

(1, (2) (3) (4) (5) (6) (7) 
o-1 | 1000.00 | 0.00 242.54 5.69 12.90 | 2.83 
757.46 | 5.80 181.43 6.26 15.86 | 3.74 
2-4 §20.03 6.65 103 .03 5.95 18.27 | 4.57 
(586.15 | 7.01 85.76 6.38 19.31 | 5.09 
508.46 7.38 64.13 6.50 20.08 | 5.56 
5-6 475.85 7.61 58.20 7.23 20.42 | 5.94 
G-7 448.15 | 7.95 43.76 7.34 20.65 | 6.31 
7-8 =| 428.54 | 8.29 43.20 8.45 20.57 | 6.60 
410.03 | 8.71 33.69 8.85 20.48 | 6.89 
9-10 | 396.22 | 9.17 46.55 12.15 20.17 | 7.13 
W-li 377.77 | 9.98 43.85 14.30 20.13 | 7.47 
12--12 361.21} 10.97 51.06 20.30 20.03 | 7.81 
12-13 | 342.77 | 12.78 0.00 00.00 20.08 | 7.79 
13 | 342.77 | 12.73 _ _ 19.08 | 7.79 


Source: California Tumor Registry, Department of Public Health, State of California. 


The second interval began with the 4030 survivors from the first interval, 
which 1s entered as N, in line.2 of column 2. All 1953 admissions in- 
cluded in .V, were due foi withdrawal in the second interval. 

The main life table functions and the corresponding sample standard 
errors as shown in Table 3 are determined from the data given in Table 2. 
The z-vear survival rate fo, is by definition equal to 1, divided by the 
radix J, , or Pop; , The sample variance of fo, is computed 
from a formula given in a previous publication [6] (see also [13]): 

z-1 
= 
Formulas (13) and (14) were both used to compute the sample standard 
error of g, , With numerical results that were almost identical to the 
fourth decimal place. The figures appearing in column 5 of Table 3 
were obtained by formnia (14). The observed expectation of life was 
determined from forrnula (17), for which f7 was set equal to pu . 
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TABLE 4 
Survival Experience AFTER Diacnosis OF CANCER OF THE CERVIX UTERI: 
EstiMaTep CRUDE AND NET PROBABILITIES OF DEATH From CANCER 
or THE Cervix UTERI AND FROM OTHER CAUSES 


Estimated crude Estimated net 
probabilities of death probabilities of death 
in interval in interval 
Estimated (z, z + 1) from (x, z + 1) when 
Interval probability 
since of surviving Cervix Cervix 
diagnosis interval Cervix Other cancer cancer 
(years) (z, z + 1) cancer causes | Acting alone | Eliminated 
z-z+1 1000 1000 Q.: | 1000 | 1000 1000 gz: 
(1) (2) (3) (4) (5) (6) 
0-1 757 .46 207.11 35.43 211.17 39.77 
1-2 818.57 155.97 25.46 158.11 2.71 
2-3 896.97 84.65 18.38 85.46 19.22 
3-4 914.24 62.89 22.87 63 .63 23 .63 
4-5 835.87 44.40 19.73 44.85 20.19 
5-6 941.80 25.76 32.44 26.19 32.87 
6-7 956.24 23.17 20.59 23.41 20.84 
7-8 956.80 22.47 20.73 22.70 20.97 
8-9 966 .31 14.44 19.25 14.58 19.39 
9-10 953.45 29.93 16.62 30.18 16.88 
10-11 956.15 24.36 19.49 24.60 19.73 
11-12 948.94 17.02 34.04 17.32 34.34 


Source: California Tumor Registry, Department of Public Health, State of California. 


Table 4 shows the estimated probability of surviving each interval 
and the estimated crude and net probabilities of death from FR, , cancer 
of the cervix uteri and R, , all other causes of death. Since only two 
risks are studied, the probability g.. is equal to q.., , the net proba- 
bility of death when cancer of the cervix uteri is eliminated as a risk 
of death from the population. For each interval the sum of #, , Q.: , 
and Q,, is unity, and the estimated net probability Q,, is always greater 
than the corresponding crude probability Q., . 
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THE SPEARMAN ESTIMATOR FOR 
SERIAL DILUTION ASSAYS’ 


A. JOHNSON AND Byron Brown, JR. 
School of Public Health, University of Minnesota, 
Minneapolis 14, Minnesota, U.S.A. 


SUMMARY 


A new estimation procedure is presented for the estimation of the 
density of organisms in a suspension by a serial dilution assay. The 
point estimator is based on the Spearman technique, is simply and 
quickly computed without special tables, and has efficiency 88 percent. 


1. INTRODUCTION 


To estimate the density of a specific organism in a suspension, a 
common method is to form a series of dilutions of the original sus- 
pension. Then from each dilution a specified volume, hereafter referred 
to as a dose, is placed in each of several tubes. Later the tubes are 
examined for evidence of growth of the organism. 

Suppose k + 1 dilutions are used with concentrations of z; = a ‘zo , 
t= 0, +1, +2, --- , +k. 2 is the highest concentration of original 
suspension used and a > 1 is the dilution factor. . 

The following notation will be used: 


x =Inz, 
n = the number of tubes receiving a unit volume (dose) for each 
dilution, 


r; = the number of tubes showing signs of growth at dilution z; , 


The usual model specifies a density of organisms of @ per unit volume 
in the original suspension and a Poisson distribution of organisms in 
the individual doses. Thus the probability of signs of growth (i.e., 
the probability of one or more organisms) in a tube receiving a dose 
at concentration z, is 


(1) 


The maximum likelihood estimator (frequently called ‘‘most prob- 
able number’’) has been proposed [5] for estimating 6. Computational 


1Research supported, in part, by a National Institute of Health Research Training Grant. 
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procedures and tables have been published to eliminate the tedious 
computations involved [1, 4, 7]. Cochran [2] has given some guides 
for designing serial dilution assays using the most probable number. 

Fisher [5] proposed another estimator for 6. Computation is tedious 
for this estimator also. Tables have been published [6] for some of 
the common experimental designs. 


2. POINT AND INTERVAL ESTIMATION PROCEDURES 


The dose levels on the log scale, zx = In z, are spaced d = In a units 
apart. The probability of a growth response at dose level z is 


F(z) =1-—e°". (2) 


The equidistant spacing and the “dose-response” function F(x) with 
the characteristics of a distribution function (increasing from 0 to 1 
with x) are typical of the common bioassay problem in which the object 
is to estimate the median (L.D. 50) or mean (ux) or F(z). For such 
situations, Spearman [9] proposed the following estimator for yu: 


(3) 
where x). = In z , d = In a, the summation is from i = 0 to t = k, and 


p; is the proportion of tubes showing growth at dilution z; . F(z) 
has mean: 


[ - In dy, 


= 


where y = .57722 is Duler’s constant [8]. The parameter of interest 6 
can then be written: 


(4) 


and the point estimator 6 of @ based on is’ 


The point estimator of the number of organisms per z, volume is 


*The point estimator 8 is biased. Its bias is discussed in Section 4 and a correction factor is given. 
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For convenience, the necessity of a logarithm table can be circumvented 
by constructing a graph of 26 as a function of )>p, , for commonly 
used values of a. Such a graph has been constructed ee 1) and 
its use is illustrated in the example in Section 6. 

If an interval estimate of 6 is desired, the neu: estimate of u 
is computed first and then the end points transformed by equation (5). 
The interval estimator of u (confidence coefficient of .95) is given in 
expression (7). The justification is presented in Section 3. 


The corresponding interva! estimator (C.C. = .95) for 6 is 
d In 2) 
n 


’ 


6 exp (— 1.96 < 6 < b exp (1.96 


where 6 is given in equation (5). 
3. CHARACTERISTICS OF 2p 


First the values of d, n and z, will be assumed fixed. Then he 
case of x» randomly chosen will be discussed. In all cases it is assumed 
that the dose levels xz; = In z; span an interval wide enough 30 that 
F(z.) > .99 and F(z, — kd) < .01. 

The following notation will be used to differentiate between aiscus- 
sions with x» fixed and discussions with x, random: 


E, denotes expectation with respect to the random variable: .. 
for fixed zp . 

E,, denotes expectation with respect to the distribution of +, when 
Z is randomly chosen. 

E denotes expectation with respect to both xz, and the random 
variables p,; . 


It follows from (2) and (3) that the expected value of , for fixed wp is 


=m +2 4 exp 
Using (4) and (9), the bias of a for fixed z , B(a | 2) +s 
Be |x) = 
= 2+ —d (1 — exp +74 in. 
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B(g | z) does not depend on n. It does depend on d and on x. For | 
a given value of d, the bias will have a maximum and a minimum over | 
all possible positions of z, , subject to the conditions that F(t) > .99 © 
and F(z, — kd) < .01. Let B(a | z.) be differentiated with respect to zp : 


= [BA | = 1 ¥ exp (xo — id) exp (1) 


For a given d, if enough terms are carried to cover the desired range, — 
it can be shown that (11) has two zeros with respect to x) on every | 
interval of length d. These solutions yield the maximum and minimum 


values of the bias over all positions x, for fixed d. Some results, obtained 
numerically, are: 


a = 4, d = In4: | B(a | zo) | S .0019, 
10, d = |n 10: | B(a | zo) | S .0417. 


(12) 


a 


Using (2) and (3), the variance of f for fixed z» , V(@ | x0), can be written 
x) = © — Fe). (13) 


The sum in (13) can be approximated by the following integral. 


The approximation will be good if d is small and the range of dose 
levels x; is large. 


Via FG) de = (14) 


n 
For a given value of d, V(a | zo) will depend on the position of zp . 


The maximum values of V(@ | z») can be obtained for given d by dif- 
ferentiating (13): 


(Va | 20) 
= exp (— [2 exp (— — 1]. (15) 


Zeros for (15) can be obtained numerically for any given d, using a 
range of doses large enough so that terms ignored in (15) are negligible. 
The values of x, thus obtained can be substituted in (13) to obtain the 
maximum and minimum variance. The results below are presented 
relative to the approximate variance (14): 
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din2 
din2 

n 

The numerical results (12) and (16) indicate that, if the range of 
doses is wide enough and the dilution factor a is 10 or less, then the 
following approximations can be used regardless of the position of 2» : 


Vala) 


n 

Suppose that z, is randomly chosen with uniform density over an 
interval of length d, say (A, A + d). Then the expectation of u can 
be taken with respect to x, and the p; : 


BQ) = | = | 20 


a= 10, d = In 10: 


< .0977. 


(17) 


A+d 


Dre) 


A+d 


(18) 
Integrating (18) by parts: 


A+d 


E(a) = az. (19) 


If the doses extend over a wide enough range, (19) indicates that 
fi has a negligible bias: 


= (20) 
For 2» randomly chosen on (A, A + d) the variance of f can be written: 
V(a) = E(a — = E,,[V(a | 20) + BA | 
A+d 


ve) =2 th ae. en 
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lt the doses extend over a range such that the first integral in (21) 
is approximated to the degree desired by integrating over the infinite 
range, }'(%) can be written 


The first integral has the value (d In 2)/n. It follows from (22) that 


| 


3 [max BY(A | 2.)}. (23) 


n 


From (23) and the numerical results in (12) the following bounds for 
the approximation of V(f) by (d In 2)/n can by obtained: 

din2 | 
n 


| va) 


d\in2 
n 


= 4, d = In4: 


< (3.76n)10°°, 


d 
| va — | 
10, d = In 10: din2 


n 
A summary of the results (20) and (24), for randomly chosen 2» , 


would be essentially the same as (17), i.e., if the doses extend over a 
wide enough range, 


(1.09n)10™. 


In concluding the remarks on the properties of @, it should be re- 
marked that the distribution of 2, conditional on zx» , is asymptotically 
normal, since it is a linear function (3) of the sum of k + 1 binomially 
distributed random variables. 


4. ESTIMATION OF 6 


The point estimator of @ based on f was defined in (5). The expected 
value of this estimator can be approximated for random choice of 2, 
as follows: 


E(6) = Ele"?~*] = 
= 6E fe 


S4 
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If the expectation is taken termwise in (26), terms in moments of order 


three and higher are dropped, and the approximation (25) is used, 
(26) can be written: 


E(6) = + (27) 


From (27) an estimator that is less biased than 6 is 


The coefficient of variation of 6, C.V. (6), can be approximated 
using (5) and (24): 


(29) 


The approximation to the variance of f, given in (17) and in (25), 
is identical with the asymptotic variance of the estimator for In 6 
proposed by Fisher [5]. Therefore the asymptotic efficiency of the two 
procedures for estimating @ will be the same, 88 percent. 

The interval estimator of @ given in (8) follows directly from (4), 
(17) and (25), and the asymptotic normality of A. 


5. DESIGN OF THE SERIAL DILUTION ASSAY . 


In choosing the range of dilutions, the conditions F(z) > .99 and 
F(x, — kd) < .01 should be satisfied. This implies that the expected 
number of organisms per dose at the highest and lowest concentrations 
should be greater than 5 and less than .01 respectively. 

In designing a serial dilution assay the precision desired of the 
estimator must be specified. This can be done by specifying the desired 
value of the coefficient of variation (29). Then equation (29) can be 
used to obtain n, the number of tubes per dilution, for any dilution 
constant a desired. 

Alternatively the precision of the estimator can be specified in 
terms of the interval estimator of 6. Using (8), the precision of the 
interval estimator for @ can be expressed in terms of the factor, R = 
1 which will be multiplied and divided into 6 to obtain 
the upper and lower limits of the 95 percent confidence interval. Graph 2 
shows the values of R as functions of the dilution factor a for values 
of n from one to ten. This graph can be used to design dilution assays 
having desired precision. See the example in Section 6. 
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6. EXAMPLE OF DESIGN AND ANALYSIS 


Suppose it is known that the density of organisms @ is between 
10° and 10°. organisms per unit volume. Suppose a ten-fold dilution 
factor is to be used. To assure doses spanning the range .01 to 5 
organisms per dose, it is necessary to use concentrations ranging from 
10~* to 10~’ dilutions of the original suspension, or 7 dilutions altogether. 

Suppose the value of @ is desired to within approximately four fold 
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TABLE I 
ILLUSTRATIVE Data 


Proportion of Tubes 


Dilution Showing Growth p 


107 
107 
10-3 
10-* 
10-5 
10-7 
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GRAPH 2 
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in either direction (i.e., the desired value of R is approximately 4). 
From Graph 2, n = 3 tubes per dilution will give R = 4.2 for a dilution 
factor of 10. This means a total sample size of 7n = 21 tubes. In 
Table 1 some data and computational results are presented for such 
a design. 

Using Graph 1, 2.6 + 80 organisms per dose at the highest concen- 
tration, 10~*. Therefore, 6 = 800 organisms per unit volume of the 
original suspension. Without Graph 1, 6 would be computed as follows: 


mi 10)(8) = —7.2915, 
6 = e~’~* = 824 organisms per unit volume of the original suspension. 
The interval estimate of 6 is obtained by using the factor R (= 4.2) 
read from Graph 2: 
824 


42 < @ < 824(4.2), 


196 < @ < 3460 (confidence coefficient of .95). 


The point estimate 6 can be corrected for bias using equation (28): 
2n 


6 = (.79)(824) = 651 organisms per unit volume. 


REFERENCES 


{1] Barkworth, H. and Irwin, J. O. [1938]. Distribution of Coliform Organisms in 
Milk and the Accuracy of the Presumptive Coliform Test. Jour. Hygiene 38, 
446-57. 

[2] Cochran, W. G. [1950]. Estimation of Bacterial Densities by Means of the 
“Most Probable Number’’. Biometrics 6, 105-16. 

[3] Finney, D. J. [1947]. The Principles of Biological Assay. Jour. Roy. Stat. 
Soc., Suppl. 9. 46-91. 

[4] Finney, D. J. [1952]. Statistical Methods in Biological Assay, Hafner, New York. 

(5) Fisher, R. A. [1922]. On the Mathematical Foundations of Theoretical Statistics. 

' Phil. Trans. Royal Soc. A222, 309-68. 

[6] Fisher, R. A. and Yates, F. [1957]. Statistical Tables for Biological Agricultural 
and Medical Research, (5th edition). Edinburgh: Oliver and Boyd. 

(7] Halvorson, H. O. and Ziegler, N. R. [1933]. Applications of Statistics to Problems 
in Bacteriology. I. A means of determining bacterial population by the dilution 
method. Jour. Bacteriology 25, 101-21. 

(8] Ryshik, I. M. and Gradstein, I. S. [1957]. Tables of Series, Products and Inte- 
grals. Veb. Deutscher Verlag der Wissenschaften, Berlin, 306. 

[9] Spearman, C. [1908]. The Method of ‘Right and Wrong Cases’ (‘Constant 
Stimuli’) without Gauss’s Formulae. Brit. Jour. Psych. 2, 227-42. 


=) 
ad 
Fy 
| 
ds 
4 


THE FITTING OF A GENERALIZATION OF 
THE LOGISTIC CURVE 


J. A. NELDER 
National Vegetable Research Station, Wellesbourne, England . 


This paper describes the fitting of the four-parameter family of 
curves defined by the differential equation. © 


Special cases were originally proposed by Piitter [1920] for various 
types of animal growth (see e.g. von Bertalanffy [1957]), and recently 
Richards [1959] has exemplified the general form of the curves and 
suggested that they may be useful for the empirical description of 
plant growth. For further details of the history of these curves and 
their mathematical properties reference should be made to Richards’ 
paper. It suffices to say here that the family defined by (1) includes 
as special cases several curves which have been used empirically for 
the description of growth, including the ‘monomolecular’ (diminishing 
returns) curve (@ = —1), the exponential curve (6 — 0 through positive 
values), the logistic curve (@ = 1), and the Gompertz curve (@ — © 
with A fixed and « a linear function of 6). 


MODEL AND NOTATION 


Incusing (1) in the description of growth, W will usually denote a 
weight of some kind, while ¢ is usually chronological time, but may be 
a suitable ‘time scale’ (Skellam et al. [1959], Nelder et al. [1960]) which 
can replace chronological time when the environment is variable. Since 
it seems to be characteristic of many growth phenomena that the 
relative growth rate (W~* dW/dt) is almost constant when the weight 
is small compared to the final weight, we shall restrict @ to be positive 
in what follows. For if @ < 0, then, for W sufficiently small, W~' dW /dt 
becomes as large as we please in absolute value. When @ > 0, the 
solution of (1) can be written in the form 


W + (2) 
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where X is the constant of integration. A and « are taken as positive 
since W is positive and is assumed to increase with time. Thus W”’ 
satisfies the logistic equation with upper asymptote A’ ast > @, 
while for ¢ large and negative 


Ww 
We now write Y = In W and a = In A so that 
Y =a— 0 In [1 + (3) 


where In denotes the natural logarithm. (3) is the form of the equation 
that will be used for fitting, and we shall assume that if y = In w is 
a sample value of Y = In W at time ¢, then E(y) = Y, and var y = o” 
independently of Y. We shall also assume that the y; (¢ = 1, --- , n) 
obtained at time ¢; , ie. the sample points to which the curve is to be 
fitted, are independent. The independence condition is satisfied for 
instance in plant growth analysis when a randomized field layout is 
used, and the sampling is destructive, i.e., a different set of plants is 
taken on each occasion. When the same set of plants is used on each 
occasion (when w might denote leaf area measured non-destructively) 
it is still possible to use the method proposed for estimation, though . 
of course sampling variances cannot be obtained from residual mean 
squares ‘within curves’ if the condition of independence is not satisfied. 
This situation is further discussed later in the paper. 

The assumption that y = In w has constant variance has been found 
to be reasonable from an examination of data on the weight of part 
or the whole of several crops. In fact it is usually found that the 
distribution of y does not differ significantly from a normal distribution. 

For the purposes of this paper, (2) will be described as the generalized 
logistic equation, though it is not of course the only generalization that 
could or has been made from the logistic equation. 


FITTING THE LOGISTIC CURVE 


We consider first the case where @ is known. This is equivalent 
to fitting the logistic equation, since by putting W’ = W”’, Y’ = Y/@, 
a’ = a/6, \’ = \/6 and x’ = «/@ we can convert (2) into the logistic 
function. We may, therefore, put @ = 1 for this case without loss of 
generality. Let us write 


= r + xt; and = = = 
l+e"% 


\ ‘ 
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then from (3) 
Y; — &), 
Thus the least-square equations are given by 
Ly — = 0, 
— Yok = 0, 
(y; — = 0. 
These equations have no explicit solution in general, so that they must 


be solved by iteration. The usual method, using the expected values 
of the information matrix (see e.g. Bailey [1951]), gives the following 


TABLE I 
EXTENSION OF BerKsON’s TABLE oF ANTI-Loaits [1/1 + 


344 
406 
462 
513 
559 
601 
639 
673 
704 
732 
758 
781 
801 
820 
837 
853 
867 
879 


891 
901 


To obtain £, from this table and Berkson’s table in Berkson [1953], use the iormula, — = antilogit 


(—r) = 1 — antilogit (7). 


91 
) 
3 
2 
r 0 1 2 3 4 5 6 7 + 9 
0.99 
5, 331 337 350 357 363 369 £376 382 
5] 394 400 412 418 423 429 435 440 446 
5. 451 457 467 473 478 483 488 493 498 
5. 503 «508 518 523 527 532 587 «541 546 
5] 550 555 564 «568 #44572 576 581 585 589 
5| 593-597 605 609 613 617 620 624 628 oo 
5, 632 635 642 646 649 653 656 660 663 5 
5, 667 670 676 680 683 686 689 692 695 : 
5] 698 701 707 713 £716 721 724 
5 727 730 735 737 743 £745 748 750 
6. 753-755 760 762 765 #767 772 774 
6 776 778 783 785 787 #789 791 793 795 
6 797 799 803 805 807 809 811 813 815 
6 817 819 822 824 826 827 829 831 832 
t 6 834 836 839 841 842 844 845 847 848 ‘ 
| 6 850 851 854 856 857 859 860 861 863 s 
6 864 865 868 869 871 872 873 875 876 
6 877 878 881 882 883 884 885 887 888 
f 6. 889 890 892 893 894 895 896 897 9898 
6] 899 900 902 903 904 905 906 907 908 


TABLE II 


VALUEs OF 104 In (1 + e77) 
For T positive —In (1 — —) = tabulated value. 
For 7 negative —In (1 — —) = |r| + tabulated value for |7]. 


4 


0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.2 
1.3 
1.4 
1.5 
1.6 
1.7 
1.8 
1.9 
2.0 
2.1 
2.2 
2.3 
2.4 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 
3.1 
3.2 
3.3 
3.4 
3.5 
3.6 
3.7 
3.8 
3.9 
4.0 
4.1 
4.2 
4.3 
4.4 
4.5 


pe 
0 1 2 3 4 5 6 
2 6931 6882 6832 6783 6733 6685 6636 6588 6539 6492 
. 6444 6397 6349 6303 6256 6210 6163 6118 6072 6027 
a 5981 5936 5892 5847 5803 5759 5716 5672 5629 5586 
5544 5501 «5459 «5417 «5375 «5334. «5293 «45252 «45211 «+5170 
re ; 5130 5090 5050 5011 4972 4932 4894 4855 4817 4779 
iy" : 4741 4703 4666 4629 4592 4555 4518 4482 4446 4410 
| par 4375 4340 4304 4270 4235 4201 4166 4132 4099 4065 
a 4032 3999 3966 3933 3901 3869 3837 3805 3773 3742 
ras 3711 3680 3649 3619 3589 3559 3529 3499 3470 3441 
" 3412 3383 3354 3326 3298 3270 3242 3214 3187 3160 
3133 3106 3079 3053 «3001: «2975 2949 2924-2898 
T a 2873 2848 2824 2799 2775 2751 2727 2703 2679 2656 
ee 2633 2610 2587 2564 2542 2519 2497 2475 2453 2432 | 
a 2410 2389 2368 2347 2326 2305 2285 2264 2244 2224 
jae 2204 2184 2165 2146 2126 2107 2088 2070 2051 2032 
ae 2014 1996 1978 1960 1942 1925 1907 1890 1873 1856 
ae 1839 1822 1806 1789 1773 1757 1741 1725 1709 1693 
ae 1678 1662 1647 1632 1617 1602 1588 1573 1558 1544 
“ae 1530 1516 1502 1488 1474 1460 1447 1433 1420 1407 
* 1394 1381 1368 1355 1343 1330 1318 1306 1293 1281 
=i 1269 1257 1246 1234 1222 1211 1200 1188 1177 1166 
= 1155 1144 1134 1123 1112 1102 1091 1081 1071 1061 
. 1051 1041 1031 1021 1012 1002 993 983 974 965 
hs 955 946 937 928 920 911 902 894 885 877 
A 868 860 852 844 836 828 820 812 804 796 
© 789 781 774 766 759 752 745 737 730 723 
7 716 710 703 696 689 683 676 670 663 657 
= 650 644 638 632 626 620 614 608 602 596 
590 585 579 573 568 562 557 551 546 541 
536 5530525) i520) 
"3 486 481 476 472 467 463 458 #454 449 445 
( 441 436 432 428 424 420 415 411 407 403 
a. 399 396 392 388 384 380 377 373 369 366 
362 359 352s 3483450 341 8335S 331 
328 325 «9322319 316 312, 3309S 3303S 300 
ia 297 295 292 289 286 283 280 278 275 272 
270 264 262) 257) 254 52249247 
244 242 239 237 235 232 230 228 226 «223 
hee 221 219 217 215 213 211 208 206 204 202 
tae 200 198 196 195 193 191 189 187 185 183 
181 180 178 176 174 173 171 169 168 166 
ae 164 163 161 160 158 156 155 153 152 150 
a 149 «147° 142140) 139-137 
135 133 132 131 130 128 127 126 124 «123 
122 121 120 118 «$117 116 114 112 
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TABLE II—(Gontinued) 


adjustments day , 5A , 5ko to initial ‘guesses’ a» , Ao , Ko for the param- 
eters:— 


n bh bay — Y,) 
De Lee | =| — YO (5) 


where £, is evaluated with A = A, and x = x). New values of &; and Y; 
are then recalculated using a, = a + da, etc. and further adjustments 
obtained, the process being repeated until whatever accuracy required is 
achieved. Note that the information matrix depends on the parameters 
only through 7; ; the process of solution of (5) would obviously be con- 
siderably speeded up if tables of ¢; and In (1 — &€;) as functions of 7; 
were available. 

Now & = antilogit (—r) = 1 — antilogit (7) in Berkson’s [1953] 
notation; hence ~ can be obtained for the range —4.99 < +r < 4.99 
from the table of antilogits given in that paper. An extension of this 
table for 5 < |r| < 6.99 which covers the region down to W/A = 0.001 
and up to W/A = 0.999 is given in Table I. Values of In (1 + e7’) 
to four decimal places are given in Table II for —6.99(0.01)6.99 from 
which In (1 — £) can be quickly derived. These tables have been 
computed from the tables of e* and In z given in Comrie [1949] and 
showd prove sufficiently accurate for most purposes. If linear inter- 
polation is used from the nearest tabulated value, using the tabulated 
first difference, rounding-off errors will rarely exceed 1 unit in the 
last place. 

Because Y;, is linear in a, knowledge of its approximate value is 
not necessary for the iterative process using the expected information 
matrix, and we can replace da, in (5) by a, , and Y by In (1 — 8), solving 
directly for a, , 5\o and 5xo . However when we come to consider the 
exact equations for the iteration, using the empirical information matrix 


4.60 100 5.10 61 5.60 37 6.10 22 6.60 14 | ; 
4.70 91 5.20 55 5.70 33 6.20 20 6.70 12 
; 4.80 82 5.30 50 5.80 30 6.30 18 6.80 11 
4.90 74 5.40 45 5.90 27 6.40 17 6.90 10 2 
5.00 67 5.50 41 6.00 25 6.50 15 
| 
| 
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instead of the expected one, we shall find no advantage in suppressing 
a , so that it will be retained in what follows. The calculations can 
be conveniently laid out as follows: 


TABLE III 
ComputinG Layout For THE Logistic CURVE 


(2) (3) (4) (5) (6) (7) 


t; = AF In (1 — &) Yi Ys 


The ¢; are entered in column 1, whence the 7; are calculated using the 
starting values of \ and x, and entered in column 2. Column 3 is filled 
up from Table I, and then column 4 calculated. Column 5 is filled up 
from Table II, and the y’s entered in column 6. a is estimated by 
i — > In (1 — &)/m from the sums of columns 5 and 6, and hence 
= y% — a@ — In (1 — &) is calculated. The coefficients on 
the left-hand side of (5) can now be calculated from columns 3 and 4, 
by summing the columns and from sums of squares and products of 
é and &,t; . Finally the right-hand side of (4) may be calculated 
from >> (y; — Y,), which should be zero, apart from rounding errors, . 
and from products of columns 3 and 4 with column 7. The equations (4) 
may now be solved using any of the standard techniques (see e.g., 
Dwyer [1951]). 


The exact iterative equations 
If we now denote the parameters to be fitted by @; , then the above 
method uses the expected information matrix whose general term is 


aY, aY, 
30, 


The exact equations for the generalized Newton-Raphson process are 
those using the empirical information matrix with general term 


oY, aY. ay, 
Cornfield and Mantel [1950] for the analogous situation in probit 
analysis found a substantial improvement in the speed of convergence 
using this latter form, so that it may be worth considering the ap- 
propriate modification here. This necessitates adding to the left-hand- 
side of (5) the matrix 
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0 0 0 


= — §) (yi — — 


and two more columns need to be added to Table III, namely 
(y — Y)(1 — &) and é” in order to compute the elements of this matrix. 
The extra calculation is not large but the increase in speed of convergence 
achieved may justify the extra labour only when oa’ is large so that the 
(y; — are appreciable. 

The dispersion matrix of the estimates of the parameters may be 
obtained in the usual way by inverting the information matrix on the 
left-hand side of (5) and then multiplying it by an estimate of o’. 
For methods of matrix inversion, and the combination of the inversion 
with the solution of an associated set of equations, see e.g., Dwyer [1951]. 

If a variance for A rather than @ is wanted, then we may use the 
approximate formula 


var A = vare* ~e** vara = A’ varé 


provided that var A is small compared to A’. 


Obtaining the starting values 


The starting values may be obtained in various ways, some wholly 
graphical and some partly so. Some compromise must be made between 
the desirability of having them as accurate as possible so that iterations 
are reduced to @ minimum, and the undesirability of spending an 
excessive amount of time on a preliminary stage of the computations. 

A semi-graphical method which I have found useful is as follows. 
First plot the logistic variable w'’’, henceforth denoted by 2, against t, 
and estimate by eye the value of A; call this estimate A’. Then plot 
In [x/(A’ — 2)] against ¢. If the logistic function is the correct one 
and A’ is-also correct, this should give, apart from random errors, a 
straight line. Now the variance of In [z/(A — z)] using the first-order 
approximation, is given by [A/(A — 2)]*o? = #0 in the notation 
already established. This variance increases as z increases, tending 
to infinity as x tends to A. Thus when the variance of z is constant, 
the graph of In [z/(A’ — 2)] against ¢ will show increasing scatter as 
x approaches A. In addition, if z has a symmetrical distribution, that 
of In [z/(A — x)] will be positively skewed for z near A and positive 
deviations will be in mean value much greater than negative ones. 
If A’ is too high, then the graph of In [x/(A’ — z)] against ¢ will show 
a tendency to level off at In [A/(A’ — A)], while if A’ is too low, the 
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graph will show an increasing slope as x approaches A. If the graph 
shows a sigmoid shape, the logistic equation is inappropriate. Assuming 
that no sigmoid shape has been found, we may obtain an improved 
estimate of A as follows. Divide the points into two sets, with the 
dividing line roughly where x = A’/2; then points with x < A’/2 are 
relatively slightly affected by errors in A, and have the smallest vari- 
ances. Fit by eye a straight line to the points on the (In [z/(A’ — z)], t)- 
graph using the points where zx < A’/2 and giving most weight to the 
smaller values of xz. Read off the fitted values of z/(A — zx) on this 
line corresponding to the values of ¢ for points having x > A’/2. If 
z is such a fitted value, then, ignoring errors, z = x/(A — 2) ahd so 
A = 2(1 + 2z)/z. Hence if we write uv = z(1 + z)/z, then wu is an esti- 
mate of A. To obtain an improved value for A, calculate for each 
point having z > A’/2 and use @, the simple mean of the u’s. The 
method is quite rapid in practice if a logarithmic graph paper is used, 
since then In [z/(A’ — z)] can be plotted directly from a knowledge 
of z/(A’ — x) and z can also be read off directly from the fitted line. 
If the improved value of A still gives curvature, the process may be 
repeated. Having obtained a good value for Aj , the starting value 
for A, we may now calculate \) and xo either by a graphical fitting 
of a straight line to the graph of In [z/(A, — x)] against ¢, using the 
relation In [X/(Ap — X)] = A + xt or by a weighted regression of 


In [z/(Ay — 2)] on ¢ using as weights & = [(Ao — 2)/Aol*. When 


TABLE IV 


CALCULATIONS TO OBTAIN STARTING VALUES FOR FITTING 
THE Logistic CURVE 


& 


u = 2(1+2)/z 2z/(73.2 — x) 


\ 
96 
at 
t 2/(71 — 2) 
—2.15 0.0529 0.0513 
Sey —1.50 0.0965 0.0934 
re. —0.85 0.155 0.150 
a —0.08 0.313 0.300 
A +0.52 0.510 0.503 
1.10 0.908 0.857 
2.28 2.38 71.9 2.16 
3.23 6.93 74.0 5.57 
= 4.00 41.8 76.5 18.0 
~ 4.65 17.2 71.1 11.0 
ie 5.00 41.8 72.4 18.0 
Mean 73.2 
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o’ is small, the fit by eye should be quite adequate, but the weighted 
regression may have some advantage when the scatter about the line 
is considerable. If logarithmic paper is used, it must be remembered 
that one cycle corresponds to log, 10 = 2.30259 units when the slope 
is to be measured. 


Worked example 


The data in Table IV relate to the growth of carrot tops in a field 
experiment, and were obtained by my colleague Mr. R. B. Austin. 
6 has been taken as known and equal to 2, so that z, the logistic variable, 
equals W”?; ¢ is a time scale based on total incoming radiation (negative 
values occur because an origin had been taken with a zero between 


0-6! 
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FIGURE 1 
MeEtuop oF OBTAINING STARTING VALUES FOR a, X, x, FROM Data OF TaBLE IV. 


@ Values of z/(71 — z) plotted on logarithmic scale against ¢. 
X Fitted values of Z = (z/A — =z) obtained from line drawn by eye through points having 
z < 35.5. 
O Values of z/(73.2 — z) plotted on logarithmic scale against ¢. 
[Note: t— scale displaced to avoid overlapping] 
~-- Line fitted by eye through points O. 
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the fourth and fifth reading; the reasons for this have no relevance for 
this example). Table IV also contains the preliminary calculations 
necessary to arrive et A, , the starting value for A. An initial plot 
of z against ¢ gave an A’, as judged by eye, of 71. x/(71 — x) was 
then calculated and recorded in the third column of Table IV. Figure 1 
shows the graph of In [z/(71 — z)] against ¢. From the solid line fitted 
by eye to the points having x < A’/2 = 35.5, the fitted values of 
z/(A — 2) for the last 5 points were read off, with u calculated for 
each, and recorded in the fourth column of Table IV. ‘The revised 
value of A’ is i = 73.2. Values of x/(73.2 — x) were then calculated 
(fifth column of Table IV) and plotted against ¢ (Figure 1); the broken 
line was fitted by eye, giving most weight to the small values of z, 
and gave A, = —1.109, x. = 0.861. The calculations for the exact 
iterative method discussed above can now be started. Filling up 
Table III we have the following: 


ty ; In (1 — &) 


—2.15 —2.960 —2.0440 —3.0105 
—1.50 —2.400 - —1.3752 —2.4868 
—0.85 —1.841 . —0.7336 — 1.9883 
—0.08 | —1.178 —0.0612 —1.4464 
+0.52 —0.661 +0.3429 —1.0773 
+1.10 —0.162 : 0.5944 —0.7774 
+2.28 +0.854 : 0.6808 —0.3547 
+3.23 +1.672 : 0.5110 —0.1722 
+4.00 +2.335 : 0.3532 —0.0924 
+4.65 +2.895 i 0.2437 —0.0538 
+5.00 +3.196 : 0.1965 —0.0401 


an wv 


SRESEE 


PP 
=) 


9 = 3.2391, (1 — &:)/n = —1.0454, ao = 4.2845, S.S. = 0.00661578. 


Adjustment equations are 
11.0000 5.3319 —1.2915 ive 0.0004 | 
3.9272 —3.0008 | dro —0.0124 |. 
8.0293 |! 5x —0.0020 | 


Solution is given by 
dao = 0.010025, 4.2940, 


= —0.021566, A, = —1.1300, 


5xo = —0.006746, 0.8544. 


—0.0020 
+0.0343 
a —0.0412 
—0.0101 
+0.0129 
+0.0157 
+0.0469 
—0.0247 
3 
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If a further iteration is required then a, should be estimated in 
the same way as a , i.e. using 7 — >> In (1 — £,)/n, and the value 
obtained above ignored. However the adjustments obtained from the 
first cycle are sufficiently small to justify our stopping in this case. 

The goodness of fit of the logistic curve can now be checked by 
comparing the residual mean square after fitting with the residual error 
obtained from replication. Using the estimates a, , A, , x, , we have 
a residual sum of squares after fitting of 0.00642 giving a mean square 
with 11 — 3 = 8 df. of 0.00080. The error mean square was 0.0033 
so that the fit is satisfactory. We may now obtain the dispersion 
matrix by inverting the information matrix and multiplying it by the 
variance of a single y; , as estimated by the error mean square. Thus 
the dispersion matrix is given by 


11.0000 5.3319 —1.2915 |" 0.00137 —0.00237 0.00066 
3.9272 —3.0008| = 0.00526 0.00159 


8.0293 0.00090. 


The standard errors of the estimates affect the second place of 


decimals, thus justifying the stopping of the iterative process after 
one cycle. 


0.0033 X 


FITTING THE GENERALIZED LOGISTIC CURVE 
If we now define 7, as (A + xt;)/0, then from (2) 
Y=a+t — 
where § = e'/(1 + gis as before, and 
The least-square equations, therefore, are given by 
=0, 


and the iterative solution from starting values ao , Xo , ko , 9 , gives 
adjustments da, etc. from the solution of 


& > B; ba | (yi — Y,) 
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TABLE V 
VaLuEs or —10*8, — 8 = r/(1 + — In (1 + e*) 


4 


0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
| 
1.2 
1.3 
1.4 
1.5 
1.6 
1.8 
1.9 
2.0 
2.1 
2.2 
2.3 
2.4 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 
3.1 
3.2 
3.3 
3.4 
3.5 
3.6 
3.7 
3.8 
3.9 
4.0 
4.1 
4.2 


100 
Z »| 0 1 2 3 4 5 6 7 8 9 
By: 6931 6931 6931 6930 6929 6928 6927 6925 6923 6921 
a 6919 6916 6913 6910 6907 6903 6900 6895 6891 6887 
" 6882 6877 6871 6866 6860 6854 6848 6841 6834 6827 
‘ 6820 6813 6805 6797 6789 6781 6772 6763 6754 6745 
6735 6726 6716 6706 6695 6685 6674 6663 6652 6640 
: 6628 6617 6605 6592 6580 6567 6554 6541 6528 6515 
. 6501 6487 6473 6459 6445 6430 6415 6400 6385 6370 
: 6355 6339 6323 6307 6291 6275 6258 6242 $225 6208 
°, 6191 6174 6157 6139 6122 6104 6086 6068 6050 6931 
; 6013 5994 5976 5957 5938 5919 5900 5881 5861 5842 
3 5822 5802 5782 5763 5743 5722 5702 5682 5662 5641 
z 5620 5600 5579 5558 5537 5516 5495 5474 5453 5432 
; 5411 5389 5368 5346 5325 5303 5281 5260 5238 5216 § 
5194 5172 5150 5128 5106 5084 5062 5040 5018 4996 | 
‘ 4974 4951 4929 4907 4885 4862 4840 4818 4795 4773 | 
“ 4751 4728 4706 4683 4661 4639 4616 4594 4571 4549 : 
"3 4527 4504 4482 4460 4437 4415 4393 4370 4348 4326 | 
— 4304 4282 4259 4237 4215 4193 4171 4149 4127 4105 | 
4083 4061 4039 4018 3996 3974 3952 3931 3909 3887 
“ 3866 3844 3823 3802 3780 3759 3738 3717 3695 3674 ) 
“ 3653 3632 3611 3591 3570 3549 3528 3508 3487 3467 
7 3446 3426 3406 3385 3365 3345 3325 3305 3285 3265 | 
2 3245 3226 3206 3186 3167 3147 3128 3109 3090 3070 
<i 3051 3032 3013 2994 2976 2957 2938 2920 2901 2883 
4 2864 2846 2828 2810 2792 2774 2756 2738 2721 2703 
2685 2668 2650 2633 2616 2599 2582 2565 2548 2531 | 
a 2514 2497 2481 2464 2448 2431 2415 2399 2383 2367 
? 2351 2335 2319 2303 2288 2272 2257 2241 2226 2211 
Fe 2195 2180 2165 2150 2136 2121 2106 2091 2077 2062 
2048 2034 2020 2005 1991 1977 1964 1950 1936 1922 
he 1909 1895 1882 1868 1855 1842 1829 1816 1803 1790 
3 1777 1764 1752 1739 1726 1714 1702 1689 1677 1665 
3 1653 1641 1629 1617 1605 1594 1582 1570 1559 1547 é 
1536 1525 1514 1502 1491 1480 1469 1459 1448 1437 é 
4 1426 1416 1405 1395 1384 1374 1364 1354 1344 1333 t 
3 1323 1314 1304 1294 1284 1274 1265 1255 1246 1236 
. 1227 1218 1208 1199 1190 1181 1172 1163 1154 1146 ( 
3 1137 1128 1120 1111 1102 1094 1086 1077 1069 1061 
; 1053 1045 1037 1029 1021 1013 1005 997 989 982 
- 974 967 959 952 944 937 930 922 915 908 i 
901 804 887 880 873 866 859 853 846 839 
i“ 833 826 820 813 807 800 794 788 782 775 6 
769 757 «+1751 745 «+739 «+733 716 a 
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TABLE V—(Continued) 


Little amendment is needed to Table III for the solution of these 
equations. A column for 6 must be inserted between columns 4 and 5, 
and 8 can be obtained rapidly from Table V. a» is estimated as 7 — 
6+ >. In (1 — &)/n, and ys — Yi = ys — a — 6 In (1 — &). All 
the elements in (7) can then be calculated. 


Obtaining the starting values 


A sign that @ is not equal to unity is the appearance of a trend 
in the values of wu during the graphical process for finding A, . If 
6 > 1, then values of u show a decreasing trend as w departs from A, 
and the lowest of them are often below the highest values of w. Con- 
versely if @ < 1, then u shows an increasing trend as w falls. According 


101 
¢ 0 1 2 3 4 5 6 7 x 9 as 
43| 710 705 699 694 688 682 677 672 666 661 ek. 
4.4| 656 650 645 640 635 630 625 620 615 610 ek 
45| 605 600 595 590 586 581 576 572 567 562 
4.6| 558 553 549 544 540 536 531 527 523 518 + 
4.7| 514 510 506 502 498 494 490 486 482 478 . f 
4.8| 474 470 466 462 458 455 451 447 444 440 : 
4.9| 436 433 429 426 422 419 415 412 40% 405 a 
5.0| 402 398 395 392 389 386 382 379 376 373 
5.1| 370 367 364 361 358 355 352 349 346 343 
5.2| 340 337 335 332 329 326 324 321 318 316 
5.3| 313 310 308 305 303 300 298 295 293 290 
5.4| 288 285 283 281 278 276 274 271 269 267 
5.5| 265 262 260 258 256 254 252 249 247 245 7 
5.6| 243 241 239 237 235 233 231 229 227 225 
5.7| 223 222 220 218 216 214 212 211 209 207 
5.8| 205 204 202 200 198 197 195 193 192 190 . 
188 187 185 184 182 181 179 178 176 175 
6.0| 173 172 170 169 167 166 164 163 162 160 : 
6.1) 159 158 156 155 154 152 151 150 148 147 
6.2} 146 145 143 142 141 140) 138 137 136 135 
134 133 132 130 129 128 127 126 125 124 
6.4} 123 122 121 120 119 118 116 116 114 114 = 
113 112 110 109 108 107 106 
6.6| 103 102 101 101 100 99 98 97 96 95 
6.7 95 94 93 92 91 91 90 89 88 88 : 
6.8 87 8 8 8 8 83 82 82 81 80 a 
6.9 80 79 78 77 7% ©6755; 74 73 
7.0 73 72 72 71 70 70 69 69 68 = 67 Pi 
4 
3 
) 
5 
6 
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to the trend shown by the w’s, @ can be changed in the appropriate 
direction, and when 6, has been obtained, with the corresponding A’, 
we can use the relation 


1 
to get estimate of \./@ and x./@ and hence of ) and x. For the 
calculations of coefficients in (7), however, only ¢ and 7 are needed. 


Worked example 


In the example above of fitting the logistic curve to W'”, we were 
in fact fitting the generalized logistic curve taking @ as known and 
equal to 2. We now consider the fitting when @ is not assumed known, 
using the value 2 as a starting point. Columns 1-5 of Table III are 
unchanged but y in column 6 is now twice the previous value since we 
were working before with In z'’? = }Inz. Column 7 will also be doubled 
and a column of 8; must be added. The revised and added columns 
are given together with the iterative equations in Table VI. 


TABLE VI 


REVISED AND ADDITIONAL COLUMNS FOR FITTING THE GENERALIZED 
Loaistic CuRVE 


2.544 —0.0038 
3.664 +0.0688 
4.510 —0.0824 
5.656 —0.0200 
6.398 —0.0162 
7.040 +0.0252 
7.824 +0.0356 
8.256 +0.0314 
8.478 
8.412 

8.478 


Adjustment equations are 
11.0000 5.3319 —1.2915 --4.4747 
3.9273 —3.0008 —2.3015 
8.0293 —0.3343 
2.1857 


4 
= 
4 
—0.1964 
—0.2865 
—0.3992 
—0.5457 
—0.6414 
—6.6895 
—0.6096 
—0.4366 
—0.2987 
—0.2054 
—0.1657 
0.0016 
—0.0244 
_ 
~0.0048 
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The solution of these equations is given by 


ba, = —0.016240, 
5A. = —0.201431, 
5xo = —0.089091, 
50, = —0.254585, 


whence 
+ = 2(—1.109) — 0.2014 = —2.4194, 
«= a(*3) + 5x, = 2(0.861) — 0.0891 = +1.6329, 
6, = 2 — 0.2546 = 1.7454. 


a, is not given because it is better estimated from g — @, >> In (1 — £;)/n 
in the next iteration, as described above. 

The rather large and approximately equal adjustments to A, and 4 
should be noted; the estimates of these parameters are in fact highly 
correlated and have relatively large variances; hence the reduction of 
the S. S. of residuals (from 0.0265 to 0.0239) following the adjustment 
is rather small. 

A second iteration using the values a, , A; , K, , & gives 


bc, = —0.00446770, ‘a = 8.5538, 
= +0.02565976, = 2.3937, 
5x, = +0.00782844, ks = 1.6407, 
50, = +0.02010622, 6, = 1.7655. 
and a third and final iteration gives 
a, = 8.5539, 
ds = 2.3917, 
ks = 1.6415, 
6, = 1.7676 
with dispersion matrix 


0.01419 0.02379 0.01526 0.06075 | 
0.13220 0.06751 0.21618 | 
0.03705 0.11775 
0.40847 
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In comparing the estimates and their variances and covariances with 
those obtained for @ assumed known and equal to 2, it should be remem- 
bered that the effective parameters estimated in the latter case were 
a/3 = a/2 etc. Thus parameter values for the logistic case should 
be multiplied by 2 and their variances and covariances by 4 for com- 
parison with those of the four-parameter curve. As was to be expected, 
the variances of @, \, and & have been considerably increased by the 
inclusion of @, also 6 itself is poorly determined with a standard error 
of +0.639, so that it does not differ significantly from the value of 2 
originally used. 


ALLOCATION OF SAMPLE POINTS 


This section is concerned with the effect on the efficiency of esti- 
mation of different allocations of position of sample points on the curve. 
Obviously points on the curve where z”* is small compared to A’”’ 
give almost no information about A, while points where x’ is close 
to A”® give almost no information about \ and x, the parameters 
defining the exponential part of growth. To reduce the range of pos- 
sibilities of the distribution of the sample points to manageable pro- 
portions, attention will be confined to samples at points equally spaced 

_ with regard to ¢ and symmetrically arranged about r = 0. [In practice, 
a rough idea of the values of the parameters will enable observations 
to be taken at times similar to those considered here]. The points of 
the arrangements considered are spaced along the ¢ scale in units of +r 
or 37, while the ranges spanned are 6r, 107, and 147. Table VII gives 
the relative sampling variances per point for the parameters, and also 
the generalized information per point using | Q |/n where n is the 
number of points and Q the information matrix for the samples. Two 
cases are considered: §@ known (3 parameters to be estimated) and 
6 unknown (4 parameters to be estimated). 

The table shows that more intensive sampling of a given-range 
[e.g. —3(4)3 instead of —3(1)3] slightly increases the sampling variance 
per point while increasing the information per point. The increased 
information implies that some or all of the correlations between the 
estimates have been reduced by the increased density of points, although 

the arrangement 3(4)3 is less efficient than 3(1)3 when measured in 

terms of sampling variances only. Increasing the range reduces the 
sampling variances per point as one might expect, and so yields more 
information than more intensive sampling of the same range. The 
sampling variances of &@, \, and & are considerably increased when 

6 is unknown, particularly when the range of 7 is small. This is of 

course a reflection of the fact that curves for neighbouring values of 
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SAMPLING VARIANCES PER PoINT OF THE PARAMETER ESTIMATES 
AND THE GENERALIZED INFORMATION PER PoINT 
FOR VARIOUS DISTRIBUTIONS OF THE OBSERVATIONS 


6 Known, a, A, x UNKNOWN, o? = 1 


TABLE VII 


of 


Allocation 


Sampling variance 


per point Generalized 


information per 
point 


—3(1)3 
—5(1)5 
—7(1)7 
—3(4)3 
—5($)5 


5.39 39.2 3.60 1.57 
3.11 23.1 1.14 16.93 
2.59 18.5 0.56 73.16 
6.05 42.9 4.21 4.32 
3.22 23.8 1.25 54.80 
2.63 18.8 0.60 252.6 


a, \, x, @ UNKNOWN, o? = 1 


Allocation 
of r 


Sampling variance 


per point Generalized 
information per 
é 6 point 


—3(1)3 
—5(1)5 
—7(1)7 
—3(4)3 
—5(4)5 


14.26 361.2 41.50 339.3 0.0324 
4.99 84.0 5.28 89.4 2.008 
3.50 53.4 1.62 54.4 20.16 
17.20 420.8 51.57 398.4 0.2655 
5.34 91.5 6.13 96.3 11.95 
3.60 45.5 1.82 56.2 130.4 


6 can be made to agree very closely over this range of 7 by suitable 
changes in the other parameters. \ and 6 have the largest sampling 
variances in all the arrangements and hence are the most difficult. tu 
estimate accurately. For instance, if an individual value of w, in an 
experiment had a percentage standard error of 5, then var y = var 
(In w) ~ 0.0025, so that with the allocation —3(1)3 we should havi 
var 6 = 0.0025 X 48.47, whence s.e. 6 = +0.35. Since @ is often in 
the range 1-2, the allocation 7 = 
determine this parameter at all accurately. Similar considerations 
apply to }. By such arguments one can find what range w must cover 
if adequate estimates of the parameters are to be obtained for 


—3(1)3 would be inadequate to 


W = +e’) 
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so that r = In [(W"/(A”’ — W'”")] = logit (W/A)'”. Thus if 6 is, 
say, equal to 2 then r = 3 corresponds to (W/A)’”* = 0.95257 whence 
W/A = 0.976; similarly r = —3 gives W/A = 0.024. Thus when 
6 = 2 the range r = —3 to 3 implies that W ranges from 2.4 to 97.6 
percent of A, and the argument above shows that this range will often 
be insufficient for adequate estimates of some of the parameters. Note 
that the larger the value of @ the greater the range of W/A required 
to cover the same range of r. 


DISCUSSION 


This paper is not the place for a discussion of the place of mathe-° 
matical functions in the description and analysis of growth. A great 
deal has been written on the subject both for and against the utility 
of this approach, and it suffices to say that I assume in this paper that 
differential equations and their associated integrals have some use- 
fulness, and that methods are required for the efficient estimation of 
the parameters. For an approach to the problem of interpreting 
growth curves which involves replacing chronological time ¢ by some 
function of meteorological observations see Nelder et al. [1960]. [In 
the example above ¢ is in fact a scale based on total incoming radiation]. 

The method of estimation used in this paper is based on three 
assumptions: (i) that the w, are independent, (ii) that E(y,) = Y, 
and (iii) that var y; is independent of Y;. It may reasonably be asked 
how suitable the procedure is likely to be if some or all of the assumptions 
are not true, and this involves us in an analysis of the errors in the Y, . 
Errors may be due to variation in the parameters from one individual 
to another, for either genetic or non-genetic reasons; or they may arise 
from errors in ¢, , caused by using the wrong time-scale. In addition 
we have the possibility of systematic errors arising from deviations 
from the form of the curve actually used in the fitting. It will certainly 
not be exactly true that if, for instance, x has a normal distribution in 
the population of individuals whose growth is being studied, then random 
sampling will enable us to set E(y,;) = Y; , where Y, is given by (3) 
with the mean of « used in the formula. But such biases will not be 
serious if the variance of « is small, i.e. if we control the genetic and 
environmental uniformity of the population well. If the w, are from 
the same experimental units throughout, which is the commonest 
cause of non-independence, the method will remain satisfactory if 
var (y,;) remains constant, where the variance is now measured within 
individual growth curves. However sampling variances of the esti- 
mates of parameters must now be calculated from the variation in the 
parameter values between individuals. If the variation between indi- 
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vidual curves is much greater than the variation within them, then the 
efficiency of the fitting method becomes correspondingly less important, 
and even graphical methods might suffice. A frequent cause of devia- 
tions of var (y,) from constancy is the occurrence of constant-variance 
weighing errors on the w scale. On the log. scale these become more 
serious as the weight falls, and lead to var (y,) increasing rapidly at 
very low values of w, where the weights are not large compared to the 
weighing errors. Difficulties in measuring very small distances on a 
plant cause the same effect. In practice the change in var (y) can often 
be approximately eliminated by a suitable increase in replication at 
very low levels of w. 

The model employed here for fitting the logistic curve may be con- 
trasted with a method suggested by Stevens [1951], who proposed 
writing it in the form 


1 = 
wo (1 + Be 


and using his method for fitting curves of the type 
z=a+tBp', with and p=e. 


Since he gave equal weight to his z’s in the fitting, this is equivalent to 
assuming that 1/w has constant variance and hence that var (In w) « WwW’ 
to the first order. Though conditions may exist where var (In w) « W’, 
in my experience it is usually much closer to the truth to take var (In w) 
constant. Several papers concerned with fitting the curve z = a + Bp’ 
have appeared (e.g. Nair [1954], Patterson [1956], Finney [1958], Patter- 
son and Lipton [1959]), but these are almost entirely concerned with 
the case where z has constant variance and where the ¢,; are equally 
spaced. Where time scales based on meteorological measurements (such 
as day-degrees) are used for field crops, it is usually impossible or 
impracticable to arrange for equally spaced ¢’s, so that methods used 
in the above mentioned papers are no longer available, even if it could 
be assumed that var z was constant. In addition, none of the methods 
is immediately applicable to the general case where @ has to be estimated. 

The reasons behind the particular choice of the parameters in (3) 
should perhaps be mentioned. There are, of course, an infinite number 
of equivalent ways of writing (3), in so far as the structural parameters 
are concerned. The particular form used has been chosen to reduce 
the computing labour of the iterative solution as much as possible, 
while retaining a set of parameters with fairly clear meanings as far 
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as growth is concerned. _Thus a = In (asymptotic value of W), while 
d and « define the exponential growth stage for large negative t when 
W ~ Ae***. A possible interpretation of 6 for a field crop in terms 
of the spatial distribution of plant material has been given by Nelder 
et al. [1960]. The form used may be capable of further improvement, 
especially from the point of view of the convergence of the iterative 
process. It is an interesting point to what extent approximately 
orthogonal parameters can be found for the general non-linear function, 
that is parameters such that the cross terms in the dispersion matrix 
are small compared to the diagonal terms, and whether the use of such 
parameters would speed the convergence of the iterative process. While 
a linear transformation of the parameters leads to equivalent iterative 
equations, non-linear transformations in general do not, and possibly 
the speed of convergence would be affected by the particular specifi- 
cation of the parameters. 

A natural extension of the generalized logistic equation as defined 
here is the five-parameter equation dW/dt = aW* — BW’ discussed by 
von Bertalanffy [1957]. The estimation of the parameters of this 
equation, subject to the same assumptions as those considered above, 
is greatly complicated by the lack of any general explicit solution of 
_ the equation. Consequently expected values would have to be computed 
by numerical integration which makes the labour involved excessive 
except to those having electronic computers available. A general 
method for the fitting of differential equations has been given by Box 
[1956], and this could be applied to the general equation. However 
we may expect that the introduction of the fifth parameter would 
entail the existence of very extensive data if reasonably accurate esti- 
mates of the parameters were to be obtained. The data discussed above 
cover a 380-fold range of weights, and the standard error of a single 
weight was about 5.5 percent, yet the s.e. of 6 was as high as +0.64. 
The introduction of the fifth parameter might well require a much 
larger range of weights to give reasonably accurate estimates. One 
consequence of this situation is that it becomes difficult to test models 
based on a priori values of the exponents ~ and », unless very extensive 
data are available, since the fits of curves having ¢ and » differing quite 
widely from the a priori values would be hardly distinguishable. 


SUMMARY 
The least-squares fit of the curves defined by 
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is derived for the case when the sample values of In W are independent, 
unbiased, and of constant variance. 

Tables are provided to assist the computing of the iterative process 
used for estimation, and a method is given for obtaining starting values 
for the parameters. 

Evaluation of the sampling variances of the estimates of the param- 
eters and the generalized information shows that the range which the 
observations cover is more important than the density of sample points 
within that range if minimum sampling variance or maximum infor- 
mation per point is required. The introduction of @ as an unknown 
increases the sampling variances of the other parameters, the increase 
being especially marked when the range of (W/A)”° is small. 

Some possible sources and consequences of deviations from the 
assumptions underlying the fitting are discussed, and also the diffi- 
culties involved in extending the fitting to the curves defined by 


dW /dt = aW* — pW’. 


It is stressed that sensitive tests of hypotheses involving a prior’ 
values of parameters may demand very extensive data. 
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COMBINED ANALYSIS OF BALANCED 
INCOMPLETE BLOCK DESIGNS 
WITH SOME COMMON TREATMENTS 


M. V. PavaTE 


Central Tobacco Research Institute 
Rajahmundry, India 


INTRODUCTION 


On many occasions, research workers have to lay out their experi- 
ments at different places with one or more treatments common to 
each experiment. The reason may be the lack of sufficient space at 
one location as in field experiments, or as in industrial problems the 
availability of only a limited number of experimental units with stipu- 
lated conditions with the result that the experiment has to be laid out 
at different plants. In either case, the research workers are interested 
in the combined analysis of the data from all of the experiments, besides 
their individual analyses. The purpose of the present paper is to sug- 
gest a simplified method to obtain adjusted treatment components for 
the combined analysis when the individual experiments are laid out 
in balanced incomplete block (BIB) designs. Gomes and Guimares 
[1958] have considered the case when the individual experiments are 
laid out in randomised complete block designs. The case dealt with 
by Gomes and Guimares could be deduced as a special case from general 
method suggested in this paper. 


NATURE OF THE PROBLEM 


Suppose there are g BIB experiments to be analysed jointly with 
the following parameters: 


v = number of treatments, 
= number of blocks, 

r = number of replications for each treatment, 

k = number of units per block, 

\ = number of times any two treatments occur together in the 

design. 

Let us suppose that, for each experiment, there are c treatments common. 
Hence, there are c + g(v — c) different treatments in all. Let us, 
following Gomes and Guimares, call the c treatments (common to all 
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experiments) ‘common’ treatments and the rest, g(v — c) treatments, 
‘regular’ treatments. Hence, the combined experiment will have for 
its parameters 


=c+gv—c), = gb, 
r’ = gr for the common treatments, 
r for the regular treatments, 
\’ = ga for two common treatments, 
\ for @ common and a regular treatment from the same 
experiment, 
0 for two regular treatments from two different experiments. 


As one can see, the design is no longer a balanced one, but still possesses 
the property of connectedness. 


METHODS OF OBTAINING ADJUSTED TREATMENT 
SUM OF SQUARES 


We proceed to find the adjusted treatment sum of squares for the 
combined analysis by two methods. The general method suggested 
by C. R. Rao [1947] treats the combined data as that of a new incom- 
plete’ Block experiment. Here we will only outline this method since 
it is well known in the statistical literature. The simplified method, 
which the author would like to introduce, makes use of the individual 
analyses already performed and this will be given later, along with 
the results for various special cases. 


Classical Method 


Here, the whole problem of combined analysis reduces to the solution 
of the adjusted normal equations 


Ct=Q (1) 


where the matrix C (v’ X v’) and the column vectors t (v’ X 1), Q (v’ X 1) 
have usual meanings. The elements of the C matrix are 


Ci = gd(v — 1)/k for common treatments, 


= —Ag/k for any two common treatments, 
= A(v — 1)/k for regular treatments, 
C4; = —A/k for any two regular treatments from the same 
experiment, 
4 = 0 for any two regular treatments from different 
experiments. 


Here the rank of the C matrix is v’ — 1 and hence we adopt a suitable 
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restriction, so that it becomes non-singular and easy to invert. We 
use the restriction 
g(v—c) 


i=c+l 


By using this restriction, the C matrix reduces to 
Erg I, 0 


-*5, I, x (a, 


where I is the identity matrix, 0 is a matrix having only zero elements, 
J is a matrix having only unity elements, X indicates the direct product 
and £ is the efficiency factor of the BIB design; the dimensions of the 
matrices are 


I, = ec Xe, 
I,=9X4q, J, = gv —c) Xe, (2) 
O=cX gv—c). 
This matrix has a pattern discussed by Roy in one of his papers (3) 
and hence the inverse can be easily found. The inverse turns out to be 


1 
Erg 


1 1 1 
I; X 3) 


where I, , I, , I; , J, , Jz , O are of the same dimensions as in (2). 
Hence, the solution of (1) turns out to be 


t=C'Q. (3) 


Finally, the adjusted treatment sum of squares is t’Q where t’ (1 X v’), 
a row vector. Total and Blocks (unadj) sums of squares are found 
in usual manner. 


Simplified method: 


This method is useful to those research workers who are not con- 
versant with matrix algebra and advanced mathematical topics. The 
method suggested here is based on individual analyses of g BIB ex- 
periments. 

Suppose we have performed individual analyses of g BIB experi- 
ments with the same parameter values, viz. v, b, r, k, \ and with c 
common treatments. 
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Q;.. = adjusted treatment total of the ith common treatment in 
sth experiment, 

Q. = >%., Q:.. = adjusted treatment total for ith common 
treatment in combined analysis, i = 1, --- , c¢, 

Q\” = adjusted treatment total for jth regular treatment in sth 
experiment, j = 1,---, (v—c),s =1,---,9. 


From these values, we obtain adjusted treatment sum of squares as 
follows. 
Form the following two-way table, Table I. 


TABLE I 


Two-Way TaBLE oF ComMON TREATMENTS AND EXPERIMENTS 
wiTH Q;,, VALUES 


c. treat 


Obtain (common treatments X experiments) interaction in the usual 
way. Then, 
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the adjusted treatment sum of squares for the combined analysis is 
Adj. Treat. S.S. (Combined Analysis) = Sum of Adj. Treat. S.S. 
from each exp. 


—1/Er (Common Treat. X Exp. Interaction) (4) 


ESTIMATION OF TREATMENT ESTIMATES AND VARIANCES 
OF TREATMENT DIFFERENCES FOR VARIOUS CASES 


Using the notation at the beginning of this section, we have 
Q;/gEr, (5) 


Further note that, if /‘° is the estimate of the jth regular treatment 
only using information from the sth experiment and if ¢{,, has a similar 
meaning for the 7th common treatment, then 


g s=1 
or 
= — + (Sa) 
t=1 


Thus to find the best pone one need only estimate the treatments 
for each experiment seperately and use (7) and (8) or (8a) to obtain 
the final estimates. Formulae (8) and (8a) could be used with advantage 
when (v — c) < cand (v — c) > c, respectively. 

At this stage, it might be noted that the adjusted treatment sums 
of squares may also be obtained by the following alternative formula: 


Adjusted treatment S.S. -> iQ; + (9) 


s=l1 


The variances of the various treatment differences are obtained as 
follows: 


(1) For treatment estimates ¢; , /;, belonging to the common group 
J 1 > Var (i; )= 2s°/gEr. 


(2) For treatment estimates f; and f{"? where é; is from common group 
and é; is from regular group. 


= Var (é, = {2 
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(3) For treatment estimates f{*’ and é{? where both of them belong 
to the regular group of the same experiment 
Vs = var — = 20° rE. 


(4) For treatment estimates ¢{"’ and ¢;* where both of them belong 
to the regular group but are from different experiments 


Vu = ver (8 1+ 


s’ in all of the above formulae stands for the estimated error variance. 


SPECIAL CASES 


(a) There may be several BIB experiments with only one treatment 
common which might be a control. In this case c = 1. 


Let ¢, denote the control. Then we have the following results. 


I. Adjusted treatment S. S. 


Sum of the adjusted treatment sums of squares from individual 
analyses 


II. Treatment estimates. 


9 


om | 9g based on (8a). 


III. The variances of treatment differences. 
(1) This case does not arise 
(2) — = 
(3) — = 
(4) Wij? — = 48°/rE. 
The numbers here correspond to those in the preceding section. 


(b) There may be several BIB experiments with the same treatments. 
In this case c = v. In other words, we want to combine several 
BIB experiments with identical parameter values and treatments, 
conducted at different places. In this case, we have results as follow: 
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I. Adjusted treatments S.S. 


II. Treatment estimates. 


c=). 


sim 


III. The variances of treatment differences. 
(1) var — = iF, 
(2), (3), (4) These cases do not arise. 


(c) The most important special case is obtained when 6 = r = XA 
and k = v. In this case BIB designs reduce to well known ran- 
domised complete block designs with c common treatments. The 
simplified analysis will remain the same with corresponding sub- 

stitution. 


(d) Cases (a) and (b) can be considered for (c). 
Since (c) and (d) have been considered in detail by Gomes et al 
[1958], they will not be repeated here. However, we present the 
adjusted treatment sums of squares for the three situations since 
they are somewhat simpler than those given before. 


Case (i), c = 1. 


Adjusted treatment 8. S. = Sum of treatment S. S. from the 
individual experiments. 


Case (ii), 1 <<c < 


. Sum of treatment S. S. from 
Adjusted trestment 5. 5. = { the individual experiments \ - 


{Interaction (common treatment X 
experiments) using treatment totals. } 


Case (iii), c = v. 


where 7’; is the total for the ith treatment over all experiments 
and G is the grand total for all experiments. 
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EXAMPLE 
(Hypothetical problem purely to illustrate the procedure) 


Two BIB experiments with the following parameter values are 
conducted and the combined analysis is required. The parameter 
values are 
v=7, b=7, r=4, k=4, A=2, g=2 and c=3. 
The following data are at our disposal. 


TABLE 2 
ANALYSIS OF VARIANCE FOR THE INDIVIDUAL EXPERIMENTS 


Experiment 1 Experiment 2 

Source d.f. S.S. d.f. 8.8. 
Blocks (unadjusted ) 6 342.71 6 63.21 
Treatments (adjusted ) 6 171.14 6 154.50 
Residual 15 165.11 15 469 .00 

= 7.50, Qf = -9.75, = —5.75, Qf = 0.50, 
Q3.. = —6.75, QS? = —8.75, Qs. = —9.75, Qi? = —8.25, 
= 0.75. = —5.00. 

Form, with Q;,, values, the following two-way table. 

t te ts Total 

E, = 18.00 7.50 —6.75 18.75 

13.75 —5.75 —9.75 —1.75 

Total 31.75 1.75 —16.50 17.00 


The (common treatment X experiments) interaction is found to 
be 31.2708. 
Hence 


Treatment (adjusted) S. 
for the combined analysis 
= 171.14 + 154.50 — 4/2 X 7 (31.2708) 
= 316.7055 
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The combined analysis is given below. 


TABLE 3 
ComBINED ANALYSIS OF VARIANCE 


Source of variation d.f.e S.S. MSS. 

Blocks (unadjusted) 13 425.375 32.7212 
Treatments (adjusted) 10 316.708 31.6708 
Residual 32 643 .042 20.0951 


With the use of the equations (7) and (8) we obtain the following 
treatment estimates for the combined analysis: 


i, = 4.5357, i, = 0.2500, i, = —2.3571, 
i? = —1.2619, = -—3.7619, = —3.4762, 
= -—0.7619, = 2.2619, = 3.9762, 
i? = —1.3810, i? = —0.4524. 

Variances of treatment differences are 
V, = 2s°/Erg = 5.741, 
9.569, 
V; = 2s°/Er = 22.966, 
+ = 15.310. 
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GENERALIZED ASYMPTOTIC REGRESSION AND 
NON-LINEAR PATH ANALYSIS* 
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Department of Experimental Statistics, 
North Carolina State College, Raleigh, North Carolina, U.S.A. 


1. INTRODUCTION 


It is a fundamental fact of statistical inference that the information 
contained in an analysis of experimental data is the sum of the a priori 
information built into the statistical model employed and the a posteriori 
information contained in the data itself. For this reason the bio- 
metrician must be concerned not only with the efficiency of his estimation 
procedures but also with the adequacy of his descriptive model. Much 
care must be taken to ensure that all relevant a priori information is 
utilized in the construction of the model. In some cases it is possible 
to derive rather sophisticated theoretical models on the basis of acquired 
knowledge and intelligent hypothesizing. These models are often con- 
veniently found as solutions of differential equations. In other cases 
little more may be known than that the biological process in question 
is continuous. In this latter case one may resort to polynomial models 
where the degree of the polynomial is either found empirically or by 
prior consideration of the number of ‘‘bends’’ which one can reasonably 
assume to take place in the process being studied. In certain other 
cases it may be known that the process approaches some asymptotic 
value. This is especially true in those cases known as “growth” pro- 
cesses. The general ineptness of polynomial models for purposes of 
describing such asymptotic situations has been repeatedly pointed out, 
although polynomials in the reciprocals may sometimes be used con- 
veniently. 

Stevens [1951] and Pimentel-Gomes [1953], writing in this journal, 
have discussed inferential methods related to one form of transcendental 
asymptotic model, the so-called exponential model, 
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y=at fe" +e, (1) 


where x and y are observables; a, 8 and y are unknown parameters to 
be estimated; and ¢ is a random error. The above mentioned writers 
give convenient computational procedures for finding maximum likeli- 
hood estimates when ¢ has a normal distribution with constant variance 
and the errors are uncorrelated one with another. Even when the 
errors are non-normal these same methods provide least square esti- 
mates of the parameters. The numerical devices used for finding the 
estimates are based on methods devised by Gauss and described in 
detail in the books by Whitaker and Robinson [1944] and Deming 
[1943]. The method of estimation, maximum likelihood, is due originally 
to Gauss and is discussed extensively by Fisher [1956] who is mainly 
responsible for our knowledge of the properties of the method. Both 
the numerical devices and the estimation method are reviewed in the 
recent work by Williams [1959]. 

Asymptotic models fall conveniently into either the class of trans- 
cendental models, including the exponential, or the class of rational 
models, including models such as the rectangular hyperbola and the 
inverse square law. 

This paper considers the general class of rational models and a 
special sub-class of transcendental models that may be regarded as 
a generalization of the exponential model. Appropriate techniques for 
estimation and for determination of asymptotic confidence limits, 
similar to those given by Stevens for the exponential, are provided. 
An alternative “parabolic method” of constructing confidence limits, 
valid for small samples, is also given. 

In many practical biological situations a number of variables are 
related one with another in complex causal networks. Wright [1918, 
1921] devised methods, which he termed path analysis, for analysizing 
such networks when the relationships between the variables are linear. 
Turner and Stevens [1959] have reviewed these methods for the im- 
portant case when some of the primary variables may be “‘fixed”’ or 
without probability distributions. The present paper considers some 
aspects of such situations when the relationships between variables are 
of the transcendental variety rather than linear. 

All of the models considered in this paper are non-linear in some of 
the unknown parameters. This implies that sufficient estimates and 
hence fiducial limits are not available. 

In addition the maximum likelihood estimates, though consistent 
and efficient, will be generally biassed, will not have normal distributions 
for small samples and will generally not have explicit forms, thus 
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requiring iterative solution methods. This is the price one has to pay 
for more adequate symbolic description of the process being studied. 
The gain in meaningfullness is generally well worth even this heavy toll. 


2. RATIONAL REGRESSION 


Let us denote a particular p-th degree polynomial by P,(x) and a 
particular g-th degree polynomial by Q,(z) and then let us consider 
the regression model arising from the rational function, 


y = [P,(z)/Q,(2)] + «, (2) 


where again z and y are observables and e is a random error assumed 
to have a normal distribution with zero expectation and constant 
variance o°. The errors are also assumed to be uncorrelated one with 
another and with the values of z. At least some of the coefficients in 
the polynomials are unknown parameters to be estimated. 

The numerical device for estimation due to Gauss, which was alluded 
to in the previous section, involves expansion of y — «¢ about “trial 
values” of the unknown parameters in a Taylor’s series. The method 
is sketched in Section 4 and is there applied to the models discussed 
in Section 3. However, a simpler approach is available for the rational 
model (2). 

Suppose that Q,(z) = 1 + 6,2 + --- + Bc", so that (2) may be 
written: y = [P,(xz)/(1 + Bix + --- + B,2°)] + « If we multiply 
through by Q,(z) and rearrange we get 


y = P,(x) — Bizy — — + Qe. (3) 


Now (3) is an ordinary multiple linear regression model except for the 
factor Q in the error term. Thus weighted least squares could be used 
for estimating the unknown parameters if the weights Q~? were known. 
A simple plan is to take trial values for the parameters 6, , --- , 8, 
and compute provisional weights. Then (8) is fitted by least squares 
giving better estimates of the @’s as well as provisional estimates for 
, ,°** , @ occurring in P,(z). Using the improved estimates 
for the #’s, better weights are determined and the fitting is repeated. 
The process is thus iterated through several cycles until stable values 
of the estimates are obtained. In many cases it is sufficient to use 
constant weights (Q~* = 1) in the first cycle of iteration, thus not 
requiring trial values. 

Asymptotic variances and confidence limits are found in the usual 
way from the final least square fit. Thus if for n observations we have 
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1 xi eee eee 


B’ = (a a, 
Y’ = (Yr Y2 Ya)s 
w =| 0 --- 0 
L@ @ 


n 


’ 


then the final estimates after iterating are 
B = (X’/WX) 'X’WY (4) 
and the asymptotic variances and covariances are given by 
Cov (B) = (X’WX)"'s’, (5) 
where s’, the estimate of o’, is provided by 
= (Y'WY — B’X’WY)/(n — p —q 1). (6) 

If the error in (2) depends upon z in a known way, only a slight 
modification of the procedure is required. Suppose that ¢, = f(z)-e. 
Then the error term in (3) will be fQe and the weight matrix W will 
be diagonal with elements (f,Q,;)~? instead of (Q,;)~*. Formulas (4), 
(5), and (6) will remain the same. 

Computational details for those not familiar with linear regression 
methods are given by Williams [1959]. Let the elements of (X’WX)™* 
be denoted by c,;; ; then the individual asymptotic confidence limits 
fay each coefficient are 

— te Sa; + te t= 0,1,---,p, 

— te < BS + te t= 1,2, 
where ¢, is a Student’s ¢ at an assigned confidence level (1 — 7) with 
n — p — q — 1 degrees of freedom. 

As an example consider the case of p = 0 and q = 1, the case of 
the rectangular hyperbola. We have 

y = [ao/(1 + 6,2)] + € (8) 


with asymptotes x = —1/8, and y = 0. Corresponding to (3) we have 
the regression equation y = a — f,rzy + Qe, where Q = 1 + 6,2. 


(7) 
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A graphical estimate of the slope — 8, can be obtained by plotting y 
against xy and provisional weights computed from w = (1 + £,z)~”. 
Then using these provisional weights we compute 


and obtain better estimates of the weights. After stability is reached 
we find 


& = (Lowy t+ wry)/X w (10) 
and the estimate of the variance o’, 


We also obtain estimates of the asymptotic variances for-a, and }, : 


(2) way)” 
Gow 
8, = 


from which asymptotic confidence limits may be calculated. 


| 


(12) 


3. TRANSCENDENTAL REGRESSION: 
THE SINGLE PROCESS LAW 


The rational models discussed in the previous section may often 
be used to conveniently describe an asymptotic process. However, 
when more than just a few terms in the polynomials are required for 
adequate description of the: process, the rational models may suffer 
from three deficiencies: (1) physical or biological meaning of the coeffi- 
cients may be lost, (2) annoying “bumps” tend to appear in the fitted 
curve which have no reality in the actual process, and (3) excessive 
numerical computation may be required due to the number of para- 
meters which need to be estimated. 

Certain transcendental models may provide a more parsimonious 
description of an asymptotic process and yet preserve considerable 
flexibility of form. By “parsimony” is meant the ability to describe 
adequately with a small number of parameters. This may be accom- 
plished by supplying the a priori information that the curve is regular 
in some way or another. For example, we may specify that the curve 
is monotonically increasing or decreasing throughout its course or that 
certain of the derivatives have this monotonicity property. We may 
then avoid the undesired “bumps” and make possible simpler com- 
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putations by using this information. In addition, since solutions of 
most differential equations are in terms of transcendental functions, 
we may wish to seek a model which is related to a class of differential 
equations, thus making interpretation of the parameters more meaning- 
ful. 

One simple class of transcendental regression models has been called 
by Turner [1959] the single process law. Consider the differential 
equation 


dn/dt = — a)/(E — 8). (13) 
Integrating we get 
n=a+ 8)’, a4) 
or u= ys. 


Now let us suppose that the observables x and y are defined by the 
error equations x = £, and y = 7 + e, where « is a random error defined 
as before. Thus the model corresponding to the single process law (14) is 


y=at Bx — 7 5) ‘15) 


for convenience we will always take x — yi > O. 

The proportionality constant 6 in (13) determines the nature of 
the process. When 4 is a positive integer, we have a polynomial process: 
however, the model (15) is non-linear when this integer is greater than 
two. When 6 is a negative integer, then we have a rational process, 
a special case of the processes studied in the previous section. When 
6 is non-integral, the process is transcendental. In particular, when 6 
approaches infinity, either positively or negatively, the model (15) 
approaches the exponential model (1), y = a + Be” + «. 

An interesting series of models, most of which are commonly used 
in curve fitting, is obtained when 2/5 = —4, —3, —2, —1, 0, +1, +2. 
These cases correspond to the inverse square root law, the inverse two- 
thirds law, the rectangular hyperbola, the inverse square law, the ez- 
ponential, the parabola, and the stratght line, respectively. Thus all 
of these are special cases of a continuously varying family of curves 
(See Figure 1). We observe that when 2/6 is negative the curves have 
two asymptotes, that when 2/5 = 0 there is but one asymptote, and 


that when 2/6 is positive there is no asymptote. Thus the exponential - 


may be thought of as a “‘transition’”’ curve between those curves possess- 
ing two asymptotes and those possessing none. Without loss of gener- 
ality we may consider only the case of decreasing curves, commonly 
referred to as “extinction curves’. Several useful constants for the 
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INVERSE SQUARE - ROOT 


HYPE 
INVERSE“SQUARE_LAW_ 


EXPONENTIAL 


FIGURE 1 
SpeciaL Extinction CurvES 


special cases enumerated above are summarized in Table 1. In this 
table the minimum level is taken in all cases to be a and the initial 
level is measured from this minimum value. 

Table 1 includes the three quarters life and the half-life. The ratio 
of the two provides a convenient means of estimating 6 graphically. 
From equation (14) we determine the initial value of 7 to be m = 
a + B(—yé8)’ and the minimum value of 7 to be main = a. The value 
of & corresponding to (n) — a)/2 is the half-life and is given by 5. = 
— 27"). Similarly we obtain the three-quarters life 
The ratio is then A = £75/tso = (1 — 4-”*)/(1 — 27”), or, inverting, 
we have 


5°’ = —log, (A — 1). (16) 


The parameter A is given for each of the special cases in Table 1. and 
the relationship (16) is charted for all cases in Figure 2. 

A more general criterion based upon any two percentile-lives is 
readily found to be 


Mp. Pa) = = (1 — — (17) 


where p, + gq: = Ps + 92 = 1. 
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inverse 
cube-root 


2 3 4 8$ 678690 
15 707i... 24142... 3.8284... 6.6568... 


XACRITERION 
FIGURE 2 
RELATIONSHIP BETWEEN THE \ CRITERON AND THE EXPONENTIAL PARAMETER 6 


4. THE GAUSSIAN ITERANT 


Gauss appears to have been the first to use the Newton-Raphson 
technique for purposes of finding least squares estimates of non-linear 
parameters. See Whitaker and Robinson [1944]. In the case of regres- 
sion with normally distributed errors, Gauss’ technique is equivalent 
to the important method of scoring due to Fisher. See Fisher’s recent 
discussion [1956] of the subject. 

The technique may be conveniently described in terms of the total 
derivative. Let us suppose that we have the general regression model 


y = +e 


where § is a vector of unknown parameters. The fitted “prediction” 
equation is, in terms of estimates b, 


= f(b, 2). (18) 

By the rule for the total derivative we get dj = 9, db, + 9.db.+-:--, 
where 7; = 09/db; . Then to a first approximation 
Ag = ro Ab; + Gro Ab, + rae 


where Ag = y — yo, Ab, = b, — bio , Abe = b, — bao, --: . The zero 
subscript indicates that trial values for the undetermined estimates 
have been inserted. Thus, we get 9 = Go + Abigio + AbsGoo + °°: , 
and finally 


Y = Yo + + + --- +e. (19) 


The symbol e represents the estimate of the error «. Now (19) is an 
ordinary multiple linear regression equation and estimates may be 


A 
4 128 
: 
| | | | NIT 
4 
= 


REGRESSION AND PATH ANALYSIS 129 


found as before. From the linear regression estimates may be found 
improved values for the original non-linear regression estimates. Thus, 
the process is iterated finding successive improvements. It may be 
seen from (19) that trial values for estimates which are linear in (18) 
are not needed to prime the iterative process. This point was noted 
by Stevens [1951] and will be illustrated in the next section. Refer to 
Deming [1943] for a fuller discussion of this technique of estimation. 


5. MAXIMUM LIKELIHOOD ESTIMATION AND ASYMPTOTIC 
CONFIDENCE LIMITS FOR THE SINGLE PROCESS LAW 


Applying the method of the previous section to the single process 
law we obtain the linearized regression equation: 


y = a+ — m)* — bdo(z — — my) 
+ b(z — m)* In (x — m)(d — d) +8, 


where dy , bo , mo , and dy are trial values and m = cd. Note that (20) 

is linear in a and b and the corrections (m — m,) and (d — d,). Thus 

provisional estimates of a and b and improved estimates of m and d 

can be found by fitting (20) by least squares. Iteration provides the 

maximum likelihood estimates. We notice that trial values of the linear 

coefficients a and b have been eliminated and are thus not required. 
For convenience we write 


(20) 


B, = a, Z = 1, 


B, b, Zi = (x (21) 
B, = —bd,(m — Mo) ; = — m)*", 
B, = bd — Zs = — m)* In (x — m). 


Then (20) may be written 


y = Bo + B,Z, + B.Z, + +e. 


We have then to find estimates of B, , B, , B, , and B; . 
found by multiple regression methods as follows. Let 
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The normal equations are Z’ZB = Z’Y and, if Z’Z is non-singular, 
the vector of improved estimates is given by 


B = (Z'Z)"'2’Y. (22) 


Whenever the error depends upon z in a known way, (22) must be 
replaced by B = (Z’WZ)™'Z’WY where W is a diagonal matrix of 
weights as in Section 2. From the results of (22) we obtain improved 
values of m and d by using the relations (21). Thus 


m = m — B,/dB, , 
d=d,+8,/B, . 


These improved values replace m, and d, in (21) and the solution (22) 
is found again, giving still better values. The process is iterated until 
constant values of m and d are found to the desired degree of precision. 
After stable values are obtained we have 


a= B,, b=B,,. c= m/d, (24) 
and the estimated curve is given by 
=a+ — (25) 


If » = 76 is known a priori, then the third row and column of Z’Z, the 
third element of B and the third element of Z’Y are deleted. Other- 
wise, the iterative process is the same as before. Application of this 
case to a biomathematical model of running records has been made 
by Turner [1959]. Similarly, when 6 is known a priori, the fourth row 
and column of Z’Z and the third elements of B and Z’Y are deleted. 
If, as is often the case in extinction curves, the minimum value a is 
known a priori then y is replaced by y’ = y — a and the first row and 
column of Z’Z and the first elements of B and Z’Y are deleted. Thus, 
we see that one estimation procedure suffices for the general problem 
and several special cases as well. However, in the case that 6 is a 
known integer, we may prefer to use the methods of Section 2. If 6 
is known to be infinite, the methods of Stevens and Pimentel-Gomes 
referred to earlier should be used. 

By examination of the definition of the Z’s, equations (21), we 
see that Z’Z is singular if 6 = 0, 1, or +. Therefore, the method 
of this section may be expected to break down in the neighborhood of 
these points. Trial and error minimization of the residual sum of 
squares, S = >> [y — a — b(x — m)*]’, may then be resorted to, perhaps 
utilizing some interpolation method such as that proposed by Will [1936]. 
Values of m and d may be selected and S minimized by ordinary straight 
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line regression techniques. Thus for the selected values m, and d, 
we may regress y on X = (x — m,)", obtaining the minimum S for 
these values: = (y — 9)” — X(y — (X X)’}. 
Similarly, values S,. for m, and dz , S22 for m, and d, , et cetera, may 
be found and the minimum located. 

Alternatively, we may choose selected values of 6 only (or of u 
only) and use the deleted normal equations to minimize with respect 
to a, B, and u (or a, 8, and 5). Then simple one-dimensional interpolation 
will locate the minimum value of S. 

When Z’Z is only moderately ill-conditioned, we may improve con- 
vergence by damping oscillations. Wegstein [1958] recommends using 
in the z + 1-th iteration (in place of m,; and d,;) the modified values 


mt = um;-, + vm; , 
ad? = ud;-; +vd; 


The following data taken from a sampling experiment [Turner, 1959] 
will illustrate the iterative process. 


(26) 


162.47 
108.10 
81.12 
66.11 
55.06 
49 .37 
39.72 
33.81 
30.90 
30.73 


1 
2 
3 
4 


SCO 


Trial values m, = —1.00 and d, = —1.00 were tried and the results 
below were obtained for successive iterations. 


Iteration m 


f 
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0 —1.00 —1.00 
| 
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Asymptotic confidence limits are easily found. After the final 
iteration the asymptotically unbiassed estimate of o” is given by 


— B, — Bs Zsy)/(n — 4). (27) 


Let us denote by c,; the element in the ith row and jth column of 
(Z'Z)"*. It has been well known since Gauss that the variance o7 
of any function of the B’s (f say) is asymptotically given by 


Dodds, 


where f; = of/dB, and f; = af/8B; . We may use as the estimated 
variance of the function the quantity 


j=0 


Then from (23) and (24) using (28) we find the estimated asymptotic 
variances for a, b, m, d, and c to be 
82 = Coos, = 
Finally, for y, using (25) and (28), we find the asymptotic variance 
= 3" > (30) 


From (29) and (30) we can construct asymptotic confidence limits. 
We get 
i,s,, 
: 
m— 
c— Sct , 


with (1 — 2) confidence each where ¢, is Student’s ¢ for the w level 
of significance with n — 4 degrees of freedom. The last confidence 
statement provides confidence limits for the true value of 7 = E(y) 


(31) 
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fora given xz. See Deming [1943] for detailed discussion of these methods 
for any model. It must be emphasized that (31) provides individual 
confidence statements at the assigned (1 — 7) level of confidence. 
Of course, the joint confidence level is much less than (1 — 7). 

These confidence statements provide a ready guide to the tenability 
of various hypotheses about the unknown parameters in: the single 
process law. If it is specifically hypothesized on theoretical grounds 
that some particular parameter takes a given value (e.g., Hy : 6 = 4p), 
then we may test the concordance of the data with this hypothesis 
by seeing whether or not the hypothesized value lies within the con- 
fidence interval. If the hypothesized value is in fact tenable, we may 
wish to refit the regression equation after making the proper deletions 
for a known a priori parameter. On the other hand, if there is no 
theoretical reason for making such a hypothesis, it does not seem wise 
to recompute the curve for some convenient value of the parameter 
lying within the confidence interval. Such practice will lead to a false 
sense of confidence in that narrower confidence limits will result from 
fixing the parameter. 


6. PARABOLIC CONFIDENCE LIMITS FOR SMALL SAMPLES 


The asymptotic confidence limits described in the foregoing section 
may be expected to give adequate results for large samples or even 
for quite modest samples if the error rate is small. For example, in 
a recent sampling investigation [Turner, 1959], it was found that, 
with a sample size of ten equally spaced points covering the major 
portion of the curve and with a coefficient of variation of two percent 
at the center of the curve, the sampling variation of d, the estimate of 4, 
was quite accurately described by the asymptotic distribution. _How- 
ever, when the error rate is much larger, it may be supposed that the 
asymptotic distribution will not suffice for construction of tests or 
confidence limits. 

We will now describe a method of constructing an exact test and 
corresponding confidence limits based upon a class of “parabolic’’ 
alternative hypotheses. 

We consider the null hypothesis 


Ho: — po = 6 — 6 = O. (32) 
Thus, when the null hypothesis H, is true, we have the mode! 
= at Bt, — +e, (33) 


where 7 = 1, 2, --+ , n for sample size n. 
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Let Zio = (x; — uo)’*; then the model (33) for Ho true is written 
VE =at bBZotea, 


the model for a straight line. 
Now, if H, is not true, we could write 
f(Z 0) +<¢. (34) 


Expanding (34) in a Maclaurin series and preserving terms through 
the quadratic, we get 


= Bot BiZin + B.Zio + & - 


If we suppose that coefficients of terms beyond the quadratic are 
negligible, then the null hypothesis, 


B, = 0, 


is equivalent to (32). An exact test of Hé is well known and is easily 
performed. 
Let 


Su Zio (> Zio) /n), 
Si = Zio — (> Zio) > Zio)/n), 
Sy = Ziyi — (o> y:)/n), (35) 


= » Zio (> Zio)’ /n), 
Sy = Zioys — (xX Zio y:)/n), 


Sw = (0 

where all summations are overt = 1,2, ---,n. Then we have estimates 
B, (S22S,, $1282,)/(Si S22 = Siz), 
B, = (81:82, — — Sis). 


Then the residual sum of squares about the fitted quadratic curve is 
given by 


(36) 


SSE = Sy = (37) 
Similarly, the residual sum of squares about the straight line is given by 
SSE’ (Si,/ S11). (38) 


Then an exact F-test for Hi : B, = 0, which is approximately equivalent 
to Hy: 4 — po = 5 — & = O, is performed by calculating 


F = (n — 3)(SSE’ — SSE)/SSE. (39) 


‘ 
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Then if F > F, , the tabular value of F with 1 and n — 3 degrees of 
freedom, the null hypothesis is rejected at the zm level of significance. 

A joint confidence region, with (1 — 7) confidence, for » and 6 
(and hence y and 6) may be found by determining those values of pu» 
and 6) which ensure that F < F,. This may be done most,conveniently, 
perhaps, by calculating F for various selected values of uo and 6) and 
then interpolating to find the confidence contours. A numerical example 
when u is known a priori has been given by Turner [1959]. 

In some cases it may be necessary to retain higher-power terms 
in the Maclaurin expansion. Since then a wider class of alternative 
hypotheses would be admitted, the power of the test against any spe- 
cific hypothesis would in general be lower. 

When observations are replicated for various values of x, the numer- 
ator of (39) may be tested against the within replicate sum of squares 
to give a more powerful test. If the within replicate sum of squares 
is denoted SSE*, we have the test 


F = n(r — 1)(SSE’ — SSE)/SSE*, (40) 


with 1 and n(r — 1) degrees of freedom, where n is the number of points 
on the curve as before and r is the number of replicates. A confidence 
region may be found corresponding to this test just as before. 

A test similar in concept to the parabolic test has been proposed by 
Williams [1959] and is discussed in connection with the single process 
law by Turner [1959]. Williams’ test is not likely to be as powerful 
as the parabolic test whenever there is more than a single non-linear 
parameter, as in the present case. If either wu or 6 is known a priori 
then Williams’ test should be considered as an alternative to the para- 
bolic test. 

Hotelling [1939] has proposed a method of constructing exact tests 
which could be applied to the construction of a test of the hypothesis 
8 = Oin the single process law. However £ is the constant of integration 
and is generally of little interest since the character of the process is 
not determined by this constant. 


7. NON-LINEAR PATH REGRESSION 
AND THE SINGLE PROCESS LAW 


Let us suppose that we have p “secondary” variables which are 
causally related to one another and to a set of q ‘‘primary”’ or “control- 
lable” variables. We imagine that the q primary variables (denoted 
t. , &,°-++ , &) have values which can be chosen at will by the experi- 
menter. In particular, it is possible that they are randomly chosen 
but this is generally not so in laboratory experimentation. The p 
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secondary variables (denoted 7, , 72, °** , 7,) are then supposed to 
be completely determined. The rates of change of one variable with 
respect to another are determined by a matrix of partial derivatives 


Mie Wee °°° Nee | 


Mo 


Mp Nap *** 


When 7; = a; , @ constant, then the relations between all of the 
variables are linear. A recent review of linear path regression analysis 
is that of Turner and Stevens [1959]. See also Turner [1959]. The 
method of path regression analysis, originally described and named 
by Wright [1921], has been applied only in the case of linear relations, 
with one exception [Tukey, 1954]. The method, however, only attains 
its full power when the relations between variables are described by 
non-linear equations. This is partly what Tukey meant when in his 
critique [Tukey, 1954] he said that classical path coefficients are ‘‘ very 
good, but they don’t go far enough’. For example, the important 
field of chemical reaction kinetics requires a path analysis wherein the 
relations between variables are governed by the law of mass action. 

In many cases we may regard ,; to be a function of »,; and £; (or »;) 
only. In particular, we may consider that the relations between varia- 
bles follow the single process law. 

Let a matrix of rate coefficients be 
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and diagonal matrices be 
re O 0 | OF 


Now we will consider the multivariate analogue of the single process 
law given by 


H’ = = D,) Ai(D, (42) 
(D, D.)" AX(D, D.) 
The analogy in form to the univariate differential equation (13) is 
evident. A solution of (42) is 


n= a + = be) ** (Es 

“(Ea — — +++ (mp — 

Ge — — (ty — 
Np = + — — me)’ 

“(Eq — — — (1). 


Thus (48) are the “structural equations” for the causal network arising 
from allowing each pathway to individually follow the single process law. 

As with linear path regression analysis many of the pathways may 
be non-existent for any particular problem. These restrictions on the 
structural equations are made simply by setting the corresponding 6 
to zero. 

Note that the structural equations (43) relate “true” variables and 
parameters. No stochastic variables have been considered yet, and thus, 
we may transform the equations arbitrarily. We take logs as follows: 


nt log (m a), log Ba) at = log B, 


nt = log(n. —a2),. & = log (& — uw), at = log , 
log (n> = Qy); log Me); ar log 
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and a* = 6;;. Then (43) becomes 

nt = af + + atti + + + + --- + 

at t+ + + + + --- + adn,* (44) 

= ap + + + + aft + akin? + + --- 
These transformed structural equations are identical in form with 

those underlying linear path regression analysis. For a more detailed 


comparison see Turner [1959]. We may write (44) in matrix notation 
as follows: 


(A, =n (45) 
where 
lat at, at}, 0 at ip 
(A, A,) az A2¢q 0 
os a, --- aS, | 
and 


= (1 & --- &%), 
n’ = (nf nF 03). 
From (45) we get by multiplication 


Amn=n (46) 
and 
n — = 
(I — A,)n = ALE. 


If (I — A,) is non-singular, we get the reduced transformed structural 
equations. 


= (I — A) “AE. (47) 


Thus, the transformed secondary variables n are expressed as linear 
functions of the transformed primary variables — and it is seen that 
the necessary and sufficient condition for this to be possible is that 
(I — A,) be non-singular. 

Inverse log transformation will then provide reduced structural 
equations in terms of the original variables also such that each secondary 
variable is a function of just the primary variables. 
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There are several types of causal relationships which are con- 
veniently distinquished. First of all there is the situation in which 
a single secondary variable is determined by one or more primary 
variables. This is the case of “ordinary regression’’, the single structural 
equation being the non-linear analogue of that corresponding to the 
usual multiple regression model. A second case is the situation in which 
two or more secondary variables are jointly determined by some or 
all of a common set of primary variables. We term this case the case 
of “‘joint” regression. In a third situation there is a chain of cause and 
effect leading from one or more primary variables to a secondary variable 
and then on to still another “secondary” variable. This case we term 
the case of “chain” regression. A final type involves cycles of causation 
or ‘‘feedback” from one secondary variable to another and back again 
after possibly passing through one or more other secondary variables. 
This is the case of ‘cyclic’ regression. All of these cases individually 
or in combination have been synoptically treated in what has gone 
before. Here we wish only to note that the several cases are distinguished 
by having different kinds of (I — A,) matrices. For this reason we 
term (I — A,) the “classification” matrix. It may be verified that 
regression type and (I — A,) are related as follows: 


regression type (I — A,) 
Ordinary 1 (scalar) 
Joint I 

Chain Triangular 
Cyclic (feedback) Non-triangular 


The heavy algebra implied by (47) may be conveniently by-passed 
by the use of algorithms (see Turner and Stevens, [1959]) when the 
classification matrix (I — A,) is triangular, that is, when there is no 
feedback in the system. General algorithms are also available. 

We now consider three elementary systems to illustrate the fore- 
going and to provide concreteness. ; 

Consider first, the “multiple regression” analogue, with the path 
diagram below. 


we 


The structural equation is then found to be 
m = a + — — (48) 


if 
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A second example involves ordinary, joint, and chain regression. 


2, 
The structural equations are 
m = a, + — 
M2 = a2 + Balt, — — 
Reduction by use of the algorithms leads to 
m = ay + — — 
M2 = a + — 
A final example is one involving feedback. The path diagram is as below 


(49) 


50) 


We have the structural equations 


m = a, + — Me)’ 


(51) 
Reduction by (47) then gives 


~ 


Let us assume independent additive normal errors in the observable 
variable y and non-existent errors in the observable variable z so that we 
have z; = fort = 1,2, --- ,q, andy; = + fort = 1,2, ---, p. 
We further assume that the «, are uncorrelated from one variable to 
another. Then after reduction we will have p independent non-linear 
regression equations, each of the form 


9 = By + Biz. — — My)” — 


where D, , D, , ---., D, are functions of the §’s and B, is a function of 
the and &s. 
Now precisely the same problem of “identification” which plagues 
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linear path analysis arises again here. The problem of estimation is 
much simplified if we can confine our attention to finding the B’s and 
D’s. In certain cases it is then possible to solve for estimates of the 
original parameters in terms of just the B’s and D’s. When this is 
possible, we say the system is just-identified. See Turner and Stevens 
[1959] for discussion. In the present instance we will restrict con- 
sideration to the just-identified case. 

Again we make use of the Gaussian iterant. We first obtain trial 
estimates Mao, Dao, Doo, Deo. Then we compute: 


Zio = — = Mw)” — Max, 
Zoo = Zyo/(t. — Mao), 
Zs0 = — Mv), 


Za+1.0 = (54) 
| Z10 log. (x. Mw); 
Z.+3,0 = Zio log, (1 — Mio), 


= Zio log, — Mo), 
for all observational vectors. In addition we define: 


B, —Byo Mw), Base B,(D, = Daw); 
B, = —Bio Di(M, Mw); Bass = Dyo); (55) 


Using the device of the total derivative we obtain the approximate 
linearization 


= By + B,Z10 + 20 + mote + (56) 


with which to replace (53). By the usual methods of linear regression 
we now obtain the estimates B, , B, , --- , Bags; and from (55) we 
get the improved estimates M, , --- , D, . These now can be used 
to replace M.. , --- , Dao in (54) and the entire process reiterated until 
stable estimates are obtained. 

If any subset of parameters is known a priori then, as in the simple 
regression case of Section 3, the appropriate Z’s are simply omitted 
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and the analysis carried out without them. For example, if it is known 
that all processes have vertical asymptotes zero, then we omit Z,, 
through Z,41.0 - 

When there is more than a single response variable, the application 
of (56) separately to each provides multiple estimates for the vertical 
asymptotes M, , M,,--- , M,. In order to provide unique, efficient 
estimates of these values, we fit, instead of (56), the family of hyperplanes 


= Box + BZ 10; + B.Z20; + + 
+ + “+ (57) 


where 7 = 1, 2, --- , p and B, , B; , --- , B,,, are common for all 7. 
Linear regression theory again provides the minimum variance un- 
biassed estimators for fitting (57). 

Now the final estimates of the B’s, M’s and D’s will not be minimum 
variance unbiassed estimates but they will be maximum likelihood 
estimates and hence they will be consistent, efficient and invariant 
under transformation. Thus, since we have assumed a condition of 
“just-identification” we can solve for estimates of the 6’s and these 
estimates will also be maximum likelihood estimates. 

If we were to have an under-identified system, then the methods 
of this section would be as satisfactory as any. However, if we were 
to have an over-identified system, another procedure would be re- 
quired. Perhaps it would be best in that case to simply determine the 
maximum likelihood estimators directly from the special case being 
considered. A general computing method appears to be too cumber- 
some for all possibilities of over-identification. 

Asymptotic tests and confidence limits for the just-identified case 
are found as before by applying linear results to the final iteration. 
The parabolic test is also applicable to the general case of this section, 
but, of course, the numerical difficulties are much increased. These 
may not be prohibitive in the simplest cases, however. 
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ON THE ANALYSIS OF SPLIT-PLOT EXPERIMENTS 


H. Leon Harter 
Aeronautical Research Laboratory, 
Wright-Patterson Air Force Base, Ohio, U.S.A. 


Summary. 


A crucial question in the analysis of split-plot experiments is whether 
or not the interaction between subplot treatments and replications 
should be pooled with the three-factor interaction of main-plot treat- 
ments, subplot treatments, and replications, the result being called sub- 
plot error. A brief history of the disagreement over this question is 
given, along with a rule for deciding, on the basis of a preliminary test 
of significance, whether or not to pool. Several numerical examples 
are cited, and one of these is worked out in detail. 


1. History. 


The simplest type of split-plot experiment is one in which a levels 
of one treatment (the main plot treatment A) are arranged in a random- 
ized blocks design with r blocks (replications R), and b levels of a second 
treatment (the subplot treatment B) are assigned at random, one to 
each of b subplots in each of the ar whole plots. It is assumed that a 
correlation p exists between the experimental errors for any two sub- 
plots in the same whole plot, but that experimental errors for subplots 
in different whole plots are uncorrelated. Historically, most statisti- 
cians have assumed that interactions between treatments and replica- 
tions do not really exist. Hence they have computed a sum of squares 
for subplot error without showing a breakdown into sums of squares 
for B X Rand A X B X R, and have used the mean square for subplot 
error for testing the significance of both B and A X B. Some (see, 
for example, Federer [5, p.'274]) have even asserted that B X R and 
A X B X R are confounded with each other and hence are really not 
separable, though it is possible to compute the two interactions arith- 
metically. On the other hand, Anderson and Bancroft [1, p. 350] have 
opined that if the subplot treatments are considered to be random, it 
is necessary to separate B X Rand A X B X R. Among those who 
have considered the case in which the interactions between treatments 
and replications are not assumed to be zero are Wilk and Kempthorne 
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{10, pp. 101-102]. Chew [3, pp. 45-49] has assumed only that the 
three-factor interaction is zero. In either of these cases, B X R and 
A X B X R must be separated, since the expected values of their mean 
squares are not identical. We shall consider cases in which it is not 
known a priori whether or not the interactions between treatments 
and replications are zero. We shall assume that FR (replications) is 
always a random factor, and consider four cases: (1) A and B both 
random; (2) A fixed and B random; (3) A random and B fixed; and 
(4) A and B both fixed. Of these, (4) seems to occur most often in 
practice. We shall define the variance of an effect involving a fixed 
factor having f levels with (f — 1) instead of f in the denominator. 
The expected mean squares are summarized in Table 1, along with the 
test ratios for A, B, and A X B. 


TABLE 1 


Summary oF ExpEcreD VALUES OF MEAN SQUARES AND TEstT Ratios 
FoR Spuit-PLot ExPERIMENTS 


Expected Value of Mean Square 
Mean (all factors random)* 

Main-plot 

Treatments A a. br b {r] {l] 1+6b—I1p 
Replications R ab (b) fa] [(1)] 1p 
Sar b {1} 1+6—I1p 
Subplot 

treatments B Ss ar (r) a (1) l-—op 
AXB Sao r 1 
BXR Sor, a (1) 1l-p 
AXBXR Sar 1 


*For brevity, we show multipliers of components of variance in the table; thus, for example, 
E(Sebr) = otetr + (1 — p)o*. If A is fixed, omit components whose coefficients are enclosed in pa- 
rentheses. If B is fixed, omit components whose coefficients are enclosed in brackets. (It is assumed 
that R is always a random factor.) 


Test Ratios for A 
If B is fixed, use Se/Ser (exact test). If B is random, use Se/Ser (assumes gas? = 0) or Sa/Sad 


(assumes oer = 0, p = 0). Satterthwaite test: Sa/(Ser + Ses — Sadr); Cochran teat: (Sa + Sabr)/ 
(Sar + Sad). 


Test Ratios for B 
If A is fixed, use Si/Sor (exact test). If A is random, use Ss/Sor (assumes oes? = 0) or Sb/Sad (as- 
sumeso*tr = 0). Satterthwaite test: + Ses — Sadr); Cochran test: (Ss + Saar)/(Sor + Sod). 


Test Ratio for A X B 


Use Sao/Sabr (exact ) 
(One is usually not in testing significance of FR and its interactons.) 
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2. Decision Rule for Pooling. 


We find in Table 1 that the expected value of the mean square for 
the A X B X R interaction is o?,, + (1 — p)o’. The expected value 
of the mean square for the B X R interaction is ac?, + c*,, + (1 — p)o” 
if A is random, but it is ao}, + (1 — p)o’ if A is fixed. In either case, 
the expected values of the mean squares for B X RandforA X BXR 
are not identical; hence they should not be pooled unless this action is 
indicated by the results of a preliminary test of significance. This 
preliminary test is a test of the null hypothesis H, : E(S,.) = E(S.,,). 
Since F is assumed to be random, both S,,/E(S,,);and S..,/E(Sas,) 
have central x*/d. f. distributions. Hence under A, the test ratio 
S,,/S.s- has a central F distribution. The form of, this preliminary 
test depends upon whether A is random or fixed. If A is random, 
H, reduces to Hj, : o;, = 0, the expected value of the mean square for 
B X Ris greater than or equal to the expected value of the mean square 
for A X B X R, and the required preliminary test of significance is 
a one-sided test on the ratio of the mean square for B X R to the mean 
square for A X B X R. If this ratio does not exceed a certain per- 
centage point of the F distribution, the two mean squares are pooled; 
otherwise, they are not pooled. This problem has been studied by 
Bozivich, Bancroft and Hartley [2], who recommend a preliminary 
test of size a, where .25 < a < .50. On the other hand, if A is fixed, 
H, reduces to Hi’ : o2,, = aci, , nothing can be said about the relative 
magnitude of the expected mean squares for B X Rand A X BX R, 
and the procedure recommended by Bozivich, Bancroft and Hartley 
must be modified slightly; the preliminary test of significance is a two- 
sided test of size a, with probability a/2 in each tail of the F distribution. 
In this case, it is not possible to make a test of size a = .25 without 
computing a special table, since no table of the upper 12.5 percent points 
of the F distribution is available. Tests of size a = .20 and a = .50 
are possible, using tables of the upper 10 percent and 25 percent poisits 
of the F distribution tabulated by Merrington and Thompson{7]. 


3. Examples. 


Five numerical examples have been taken from the literature and the 
subplot error for each has been subdivided into B X RandA X BXR 
interactions. In each of these examples, it appears reasonable to assume 
that both A and B are fixed factors, so that the preliminary test of 
significance will be two-sided. Table 2 gives, for each example, the 
source of the data, the mean squares and the numbers of degrees of 
freedom for B X R and for A X B X R, the ratio of the larger mean 
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TABLE 2 
PRELIMINARY TESTS FOR POOLING IN SpLit-PLoT EXPERIMENTS 


BXR | AB XR Critical 
Interaction Interaction Larger M. S. Values 
Source Ratio = 

Code M.S. M.S. df. Smaller M.S. | Fo F.2 
(1) 0.0367 15 0.0236 30 1.56 1.72 1.32 
(2) 15.29 6 25.50 42 1.67 2.78 1.75 
(3) 28.08 9 13.18 27 2.13 1.87 1.37 
(4) 119.21 15 206.02 30 1.73 1.87 1.40 
(5) 19.36 70 21.03 140 1.09 1.32 1.16 


Sourcés: (1) Snedecor [9, Table 11.35, p. 310], (2) Federer [5, Table X--2, p. 276], (3) Anderson 


and Bancroft [1, Table 23.2, p. 348], (4) Rider[8, Exercise 4, p. 191], taken from Yates [11], (5) Cochran 
and Cox [4, Table 7.5, p. 225]. 


square to the smaller and the upper 10 percent and 25 percent points 
of the F distribution. If we use a preliminary test of size a = .50, 
we conclude that B X Rand A X B X R should not be pooled for the 
data from sources (1), (3), and (4). Even if we take a = .20, we decide 
not to pool in the case of the data from source (3). Let us now look 
at this example in greater detail. The data represent yields (in bushels 
per acre) of corn in an experiment conducted to compare four methods 
of seedbed preparation and four methods of planting, using four blocks. 
The methods of seedbed preparation are the whole-plot treatments A, 
the planting methods are the subplot treatments B, and the blocks 
are the replications R. Since there are two fixed factors (A and B) 


TABLE 3 

ANALYSIS OF VARIANCE FOR Data FROM SouRcE (3) ABOVE 
Source of Variation d.f 8.8. M.S. F 
Seedbed methods (A) 3 194.56 64.85 64.85/17.58 = 3.69 
Replications (R) 3 223.81 74.60 
AXR 9 158.25 17.58 
Planting methods (B) 3 | 4107.39 | 1369.13 | 1369.13/28.08 = 48.76** 
AXB : 9 221.74 24.64 24.64/13.18 = 1.87 
BXR 9 252.72 28.08 
AXBXR 27 355.75 13.18 
Total 63 | 5514.22 _ 


**Significant at the one percent level. 
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and only one random factor (2), the fixed main effecis and interaction 
are tested by their interactions with the random factor, as shown in 
Table 3. Anderson and Bancroft give the correct test for the whole- 
plot treatments A, but test the subplot treatments B and the inter- 
action A X B by the subplot error obtained by pooling 4 x # and 
A X BX R. In this case they reach the correct conclusions (B highiy 


significant, A X B non-significant), but clearly this will not always 
be true. 


4. Strip Plots and Sub-subplots. 


The expected mean squares are different for strip-plot experiments 
than for ordinary split-plot experiments, but the same tests are ap- 
propriate (see, for example [4], [6]), and the same preliminary test for 
pooling can be used. The theory can also be extended in an obvious 
way to split-split-plot experiments. If it cannot be assumed thet the 
whole-plot treatment A, the subplot treatment B, and the sub-subplot 
treatment C do not interact with replications, the sub-subplot error 
should be partitioned into four parts C x R, AXCXR,BXKCXR 
and A X B X C X R. In the most common case (A, B and C fixed, 
and R random), these are appropriate for testing C, A x C, B xX C 
and A X B X C respectively. They should be pooled only if this is 
indicated by the results of appropriate preliminary tests of significance. 
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156 NOTE: Confidence Intervals for Recombination Experiments 
with Microorganisms 


A. W. 
Mathematics Panel and Biology Division, 
Oak Ridge National Laboratory,* 
Oak Ridge, Tennessee, U.S.A. 


1. METHODOLOGY 


In a typical recombination experiment, a parent bearing a bio- 
chemical mutation at the A locus is mated with a parent bearing a 
biochemical mutation at the B locus, both loci on the same chromosome. 
The two parental types are thus (a+) and (+b). The loci (or markers) 
are usually identified with nutrient elements such as adenine or lysine, 
and a mutation at one of these loci signifies that the organism carrying 
the mutation will not grow unless the corresponding nutrient element 
is present in the growth medium. Progeny from such matings are of 
four types. If no recombination takes place on the chromosome between 
the loci, progeny must be of the parental types, and the two classes 
have equal expected frequencies. If a single recombination takes place 
in this region, the progeny will be (++) or (ab) and these types like- 
wise have equal expected frequencies. Types (++) and (ab) are called 
recombinants and types (a+) and (+5) are called non-recombinants. 
Clearly the frequency of occurrence of recombinants is dependent on 
the distance between the loci, assuming that recombination is equally 
likely to occur anywhere along the chromosome, and obtaining informa- 
tion about distances between loci is one of the main purposes of a 
recombination experiment. The proportion of recombinants is called 
the recombination frequency and the proportion of the (++) type 
is called the prototroph frequency. 

Since, ordinarily, the four types of progeny cannot be separately 
identified, platings are made both on complete media (containing ele- 
ments A and B) and on minimal media (A and B omitted). All four 


1Present address, Department of Biostatistics, The Johns Hopkins University. 
*Operated by Union Carbide Corporation for the U. S. Atomic Energy Commission. 
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types will grow on complete media but only the prototroph (++) 
will grow on minimal media. Thus, if a count of z organisms is found 
by plating v, ml of a suspension on a minimal medium and a count of 
y organisms by plating v, ml on a complete medium, the prototroph 
frequency is estimated by 


R = o,2/v,y. 


If the total count, S = x + y, is large and neither z nor y is too small, 
a confidence interval can be computed from the formulas, 


R, =»,(T — A),1+ 4), Ry 4) 


where R, and Ry , respectively, are the lower and upper confidence 
limits, T = 2/y, A = t,(T/S)’, and where ¢, is the normal deviate 
corresponding to a 100(1 — a) percent confidence interval. 

As a numerical example, consider an experiment that yielded the 
values v, = 180, = 561, v, = 2.5, y = 3759 for which 95 percent 
confidence limits (¢.5; = 1.96) are required. We have 


__(2.5)(561) _ 
= (80)(3759) ~ ‘002078 


T = 561/3759 = .14924, S = 561 + 3759 = 4320, 
A = 1.96(.14924/4320)'” = .01152, 


(2.5)(.14924 — .01152) _ 
+ 


2.5(.14924 + .01152) _ 
Ru = — 01162) ~ -202258. 


These limits are not, in general, symmetrical about the estimate. 

In many cases, the experimenter will have preliminary information 
about the expected orders of magnitude of the ratios z/v, and y/v, 
and may want to know how large an experiment is required to yield 
a reasonably precise estimate of the prototroph frequency. One measure 
of the precision is the relative width of the confidence interval, 


W= (Ry = R,)/R. 


Ordinarily, one is interested in relative widths of not more than 30 
percent (W < .30), and in this case the relation, 


S = + (2) 


holds approximately. Both the total count S and the ratio of the 
counts 7’, are important in determining the precision of the estimate, 
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and it can be shown that if the counts are in a 1 : 1 ratio, the total 
count required for a specified relative interval width will be a minimum. 
With this in mind, if we put 7 = 1 in (2), the total count required 
is given by 


S = 16¢,/W’. (3) 


Thus the procedure is to decide on the confidence level and interval 
width desired and to calculate the total count from (3). Then, knowing 
that the counts should be in a 1 : 1 ratio and having preliminary in- 
formation about z/v, and y/v, , the amounts to be plated out can be 
calculated directly. If the amounts calculated turn out to be ex- 
perimentally unfeasible, one might have to depart from a 1 : 1 ratio 
for the counts, in which case, the total count can be calculated from (2) 
for whatever value of T is found to be acceptable. Some trial and 
error with the use of (2) may be necessary in these cascs. 

As an example, consider a situation in which it is expected that 
the counts from a standard suspension will be about 2 per mi for x 
and 5,000 per ml for y. A relative width of about 2U percent is desired 
for a 95 percent confidence interval. Thus ¢, = 1.96, W = .20 and 


S = 16(1.96)’/(.20)? = 1540, 


whereupon the amounts to be plated should be chosen sv that approxi- 
mately 770 counts will be obtained from the platings on each mediurn. 
In other words, », = 770/2 = 385 and v, = 770/5,000 = 0.15 are 
the amounts to be plated on minimal and complete media, respectively. 

Although sample sizes determined by this method may be used as 
a rough guide in designing recombination experiments, they should 
not be regarded as optimum. Readers interested in statistically more 
sophisticated procedures for determining sample sizes should refer to 
Birnbaum [1954]. 


2. DERIVATION 


Since v, and v, are known constants, the statistical problem consists 
of finding a confidence interval for the parameter 6 = y/v, where 
uw = E(z) and vy = E(y). Assuming that z and y are independently 
distributed Poisson variables, Birnbaum [1954] showed that a confidence 
interval for @ may be constructed by treating p = y/(x + y) asa 
binomial variable with E(p | x + y) = x = v/u + ». Limits on z, 
say (7 , 7,), are easily obtainable from published tables of the binomial 
distribution, whereupon limits on @ are given by 


= (1/m) —1 and 4 = (1/10) 
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In many cases, however, z + y is large and the normal approximation 
to the binomial distribution may be used to calculate the limits on z, 
providing x is not near zero or one. This in turn leads to the limits 
given in (1). 

The relative width of the confidence interval for the prototroph 
frequency can be written as 


W =W,(1+ — (W/4)] 


where W, = (x, — 2)/p, the relative interval width for x. In most 
practical situations W, will be small, say < .30, in which case W > 
W,[1 + (1/T)]. When the substitution is made for W, , the result 
can be expressed as in (2). For fixed W, this expression has a minimum 
at 7 = 1, and (8) follows directly. I am indebted to the referee for 
pointing out that if W < .30, W, will be about half as large as W 
when 7’ = 1, and the approximation will be even better than indicated. 
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157 NOTE: Estimation of Proportion from Zero-truncated Binomial 
Data 


G. N. WILKINSON 
Division of Mathematical Statistics, 
Commonwealth Scientific and Industrial Research Organization, 
University of Adelaide, South Australia 


INTRODUCTION 


An important problem of estimation arises, chiefly in genetical con- 
texts, when the proportion of individuals bearing some particular 
character must be estimated from the frequencies observed in a random 
sample of groups or families which have at least one individual bearing 
this character. The estimation by maximum likelihood of the proportion 
p in this situation has been discussed principally by Haldane [5], [6], 
Fisher [3] and Finney [2]. 

In conjunction with D. G. Catcheside’s studies on tetraploid maize [1] 
the author derived an iterative procedure, which is presented here, with 
a simpler mathematical form and rather better convergence properties 
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than the standard method based on score and information, which is 
described by Finney [2]." 

It was found preferable to estimate, instead of p, a parameter M 
related to p by the equation X/M = p, where X is the total number of 
character-bearers in the sample. The parameter M may be thought 
of as the total number of individuals that might have been observed 
if families with no character-bearers had also been ascertained in the 
sample. In this respect the method described here is a special case of 
the general method given by Hartley [7], in which the application of 
maximum likelihood to data with missing (or coalesced) ‘classes is 
facilitated by estimation of the relevant frequencies for those classes. 
A similar approach has also been used by Fyfe [4]. 

The formula given below, however, leads to less computation than 
Hartley’s procedure, when several different sizes of family are involved. 
In the present instance accelerated convergence has been obtained by 
applying Newton’s method of approach by tangents to the sclution 
of the equation M = M*(M), and requires, for each cycle of computa- 
tion, only a simple iteration in this equation. Hartley, on the other 
hand, employs the “approach to the geometric limit’”’, which differs 
from Newton’s method in that approximating chords are used in place 
of tangents, and thus requires two basic iterations in each cycle. 


THE ITERATION FORMULA 


The basic sampling distribution is the zero-truncated binomial. 
Accordingly, the likelihood function for the data, using the notation 
given in Table 1, is essentially 


L = X logp + (N — X) logg — aE log (1 — q'). (1) 
If M = X/p, m* = n,/(1 — q*), and M* = >> m* , the differential 
coefficient of the likelihood function with respect to M, may be ex- 
pressed in the form 


— 
(2) 


1In Finney’s paper it is suggested, with reference to the tables of scoring functions provided, that 
linear interpolation may be used to obtain more accurate approximations to the maximum likelihood 
value #. Asa matter of principle it may be noted that linear interpolation is insufficient for this 
purpose. Linear interpolation, between the entries for p: and p: say, will determine an estimate 


intermediate to the two estimates based on pi and ps respectively, but generally the latter estimates 
will both lie on the same side of the maximum likelihood value #, even when »: and ps are themselves 
on opposite sides of #. More accurate interpolation, based on second differences as well, is therefore 
necessary. 
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The factor p/Mq is essentially non-zero. Application of Newton’s 
method, in which the successive approximations to the zero of f(z) 
are given by the formula 


Teor = — 
to the factor (M — M*) gives the formula 


= (Mt = r,.M,)/(1 = (3) 
where 
M q 222 f 
TABLE 1 
NOTATION 
Number of Number of 
Size of group, s Frequency individuals character-bearers 

2 fe n2 = 2fe Xe 
3 fs = 3fs 73 
Total F N xX 


Estimation is completed by determining # as the proportion of X 
to M, and the amount of information: in relation to p may then be 
determined as 


i,= (1-71), (4) 


evaluated at the point of maximum likelihood. The amount of infor- 
mation differs from that for simple binomial data by the factor (1 — r). 
The factor r thus emerges as the relative amount of information lost by 
truncation in the sample. This factor governs the rate of convergence 
of the iterative process, convergence being extremely rapid when r is 
small. Even when the loss of information is as high as 40 percent, 
however, as in the example below, a single cycle of computation, from 
a suitable starting point, will give a high degree of accuracy. 

A very good starting point is provided by the (iterative) solution 
of the much simpler equation 


M (5) 


Se 
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where & is the mean size of families observed, or the equivalent equation 
for the Poisson distribution (if p is small), as used by Fyfe [4]. Indeed 
for many practical purposes the solution of (5) will itself be sufficiently 
accurate. In the example given below, in spite of the wide range of 
family sizes, the solution of (5), roughly M = 3,300, is within 3 percent 
of the correct maximum likelihood value, and, in the case of the data 


on human albinism discussed by Haldane [6], the relative error is 4 
percent. 


EXAMPLE 


Catcheside [1] gives the numbers of recessive su seeds observed on 
24 cobs from tetraploid maize triplex for su—1* and pollinated by 
nulliplex. It was possible that the parent plants of one or more of the 
seven cobs observed with no su seeds might have been quadrupiex 


TABLE 2 
IreRaTIVE SOLUTION OF THE Equation M = M*(M) 

18 1 7935 87.2 . 83749 110.76 
65 2 4338 114.8 .52707 137.44 
74 1 3864 120.6 -48235 142.95 
76 1 3766 121.9 .47294 144.20 
84 2 3399 127.2 .43709 149.22 
86 2 3312 128.6 .42856 150.50 

131 2 1858 160.9 . 27508 180.71 
140 3 1655 167.8 . 25174 187.10 
173 2 1083 194.0 . 18187 211.46 
181 2 0977 200.6 . 16808 217.57 
188 2 .0893 206.4 . 15688 222.98 
206 1 .0709 221.7 13139 237 .16 
211 1 .0665 226.0 12507 241.16 
219 2 .0600 233.0 11559 247 .62 
223 0570 236.5 .11112 250.88 
251 3 0398 261.4 .08433 274.12 
259 2 0359 268.6 .07794 280 . 89 
N = 2585 X = 33 M® = 3077.2 M*¥ = 3386.72 
S = (m* — n,)/fe So = 73,842 Si = 142,276 
(all fe = . F = 17) To = 0.3694 "1 = 0.41850 
1 — ro = 0.6306 1 —r, = 0.58150 
Me = 2585 M, = 3,366 M; = 3,401.63 
po = 0.012766 tri = 0.009804 p: = 0.009701 
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rather than triplex. As a check, therefore, p was estimated from the 
17 cobs with one or more su seeds. The iterative calculations for these 
data are set out in Table 2, and give an estimate M = 3402 in very close 
agreement with the actual number of seeds observed, 3359, on all 24 cobs. 
The sensitivity of the data on this point is rather low, however, as the 
relative standard error of M (and #), determined by formula (4), is 
23 percent. 

To exhibit the convergence properties, and for comparative purposes, 
M, = N = 2585 has been chosen as the starting point for the co: 
putations in Table 2. In practice the value 3359 might have been 
chosen, or, lacking this information, the approximate solution 3300 of 
the simpler equation (5). A single iteration would then have given far 
more accuracy than actually required. The values of g* were determined 
in this instance by logarithmic calculation, but this calculation can be 
avoided if suitable binomial tables are available. The values r, are 
determined essentially by the computation of a weighted sum of pro- 
ducts of the m* with the differences mt — n, (not shown). ~~. 

To compare rates of convergence ths successive approximations to 
the solution of dL/dp = (M — M*)/q, by adjustment in terms of score 
and information, were also determined. The comparison is shown in 
Table 3. It can be seen that the present method does very much better 
from a poor start than the other, requiring only one iteration instead 
of two. However the disparity diminishes, as one would expect, in 
the neighbourhood of the maximum likelihood value. 


TABLE 3 
Successive APPROXIMATIONS TO M = 3401.76 By APPLICAT NewTon’s 
MertuHop THE Equations M = M*(M), (ii) /dp = 0. 
ERRORS ARE SHOWN 


Me M, 


M = M* 2585 | 3366(1%) | 3401.63 (0.004%) 3401.76 (22) 
dL/dp =0 | 2885 | 3697(10%) | 3421.19(0.6%) 3402 .04'(0.008%) 


COMMENT 


The approach which led to the present meth«.. was somewhat dif- 
ferent from Hartley’s and may be of :nterest in c’, » connections. In 
the simplest cases the differential of che likelihood fuxction with respect 
to a parameter (or some related parameter), 9 sey, can be factorized 
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into two parts, one linear in 6 and the other essentially non-zero. More 
generally, 


dL/dé = g(l(@, x)).h(@, x) 


where 1(6, xz) is linear in 6, g(0) = 0, and A(6, z) is essentially nonzero. 
In such cases one has the explicit solution given by the zero of 1(@, x) 
and no iteration is necessary. It was argued that, in the less simple 
cases, a similar factorization might be sought, in which 1(@, x), though 
not strictly linear, was in some sense asymptotically linear. In the 
present instance, for example, the function M — M* is asymptotically 
linear in M as ¢ = X/F increases. A sketch graph of this function is 
shown in Figure 1. Curvature of the function is restricted by a tangent 


FIGURE 1 
Sxetcn GRAPH OF THE Function M — M*(M) 


(The intercept X(N — F)/(X — F) corresponds to Haldane’s estimate of p, [6], from data in which 
the probability of ascertaining a family is proportional to the number of affected individuals in the 
family.) 


and an asymptote which differ in slope only by the factor 1/z. Generally, 
it could be expected that iterative methods applied to 1(@, x) rather 
than to dL/d@ would give more rapid convergence, as the functions 
g and A in the latter will generally have an adverse effect on convergence 
by increasing curvature. The comparison is complicated, however, by 
the presence of one or more inflexion points in dL/dé. 
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7 GUMBEL, E. J. Statistics of Extremes. New York: Columbia University 
Press, 1958. Pp. xx + 375. $15.00. 


B. F. Kimpaui, New York Public Service Commission, New York, N. Y., U.S.A. 


This work is essentially a source book on the statistics of extreme values. 
Furthermore it is the only source which is comprehensive. Viewed as a source book 
it is well arranged and well documented. The bibliography includes over 600 books 
and articles, to which specific references are made throughout the book. Of these, 
34 are papers by the author, or written in collaboration. 

The first chapter deals with definitions and the basic concepts. In the second 
chapter, on order statistics, the theory of exceedances is set forth and handled in a 
straight-forward and concise manner. 

In Chapter 3 the exact distribution of the extreme values of a sample are intro- 
duced. Chapter 4 is given over to an analysis of the exact distributions of extremes, 
pointing out the essentially different types which are important. 

These first four chapters comprise nearly half the subject matter and include 
all the material dealing with “exact’’ distributions of extreme sample values. Per- 
haps the principal advances in the study of the distribution of extremes in the last 
twenty five years have been brought about by recognition that the asymptotic 
forms of these distributions are usually adequate as approximations to the exact 
distributions in the applied field, and are much simpler to deal with. 

The asymptotic forms fall naturally into three classes, of which the doubly 
exponential is of the most consequence since it is the asymptotic form which results 
from a universe having an “exponential” type of cumulative distribution function, 
such as the logistic, normal and Gamma distributions. A second class, or ‘“‘second 
asymptote” as referred to by Gumbel, stems from distributions of the Cauchy or 
Pareto type. The third class is often referred to as Weibull’s distribution when 
used for the lower extreme. Both can be obtained from the first by suitable loga- 
rithmic transformations. The first ‘asymptote’ and its applications are discussed 
in Chapters 5 and 6, covering about 100 pages. This class is of particular interest 
to biologists—for example, in studying extinction times of bacteria'. The second 
and third asymptotes are treated in Chapter 7. Chapter 8 is devoted to the asymp- 
totic forms of the range and mid-range, and their applications. 


\Statistical Theory of Extreme Values and Some Practical Applications, U. 8. Dept. of Commerce 
N. B. S. Applied Math. Series 33(1954) p. 43; U. S. Gov. Printing Office, Washington, D. C. 40c. 
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This reviewer finds the author very ingenious in deriving methods which he 
believes will facilitate application of the theory to practical problems. Some of 
these ingenious devices are open to criticism. For example, the discussion of the 
topic Fitting Straight Lines on Probability Papers (pp. 34-36) is more ingenious 
than logical. 

’ A formula is developed at the bottom of page 23 for the probability that a 
return period 7’ will be reached in »v trials. This probability is ‘distribution free”’ 
and for large 7 can be approximated by W(v) = 1 — exp (—v/T). From this 
Gumbel develops what he calls a ‘‘control interval”. 

T/X <v < T,X > 0, the probability that v will lie on this interval being given 
by P =e"'” —e-. This is ingenious, but, since the probability density function 
is monotonically decreasing from v = 0, the shortest interval carrying the same 
probability would be 0 < v < Td’ with P = 1 — e~*’. Hence, for P = .6827,’ = 
1.148 and the “control interval” would be 0 < v < 1.1487 rather than (.3196)T < 
v < (3.129)T. 

A similar criticism applies to the control intervals (5) on page 215. These 
intervals are centered at the mode. The “shortest” such interval for P = .6827 
would be given by A = —.917, A = +1.317, rather than by A = +1.141. This 
would change (8) to T(y + A) = 3.732T(y) — 1.366, T(y — A) = 0.4007T(y) + 0.300. 
Likewise for P = .9545, A’ = +3.0069 becomes A’ = —1.59 and 3.251. Thus, 
when the underlying distribution is asymmetrical, some adjustment should be 
made when possible. 

In presenting The Extreme Control Band of Table 6.1.6, p. 218, and in other 
references to the control band, besides noting the lopsided character of the ‘‘shortest 
interval” of fixed probability, the author might have pointed out that most of 
these formulae presume a correct estimate of the scale parameter a. As I pointed 
out in my contribution to the book, the uncertainty of the estimate of a will involve 
the variance as a function of y, the distance from the mode, and N, the size of the 
sample used in determining a [cf. (2) p. 235]. Thus the complete variance upon 
which a confidence band should be based will involve both the ‘‘variance of position” 
of the fitted line and the intrinsic variance of the order statistic (used as a basis 
for Table 6.1.6.)*. 

Other formulas for the ‘‘variance of position” are casually referred to, such 
as the “Control Curve of Dick and Darwin”, Section 6.1.7; and formula (5), p. 
228, which gives the sampling variance of an estimate x obtained by the method 
of moments. In using this latter formula it should be made clear that the o* in 
the denominator refers to the variance of the sampled values and hence might be 
replaced by s*. The formula (6) which follows it would appear to be in error. With 
(x*/6)/a? substituted for o? and o%{y) = a*o*(z) the a* should cancel out. Also 
the significance of (6) is not clear since y is usually assigned a value in making an 
estimate from (4) by the method of moments. 

The matter of the identification of the variance of the order statistic vis 4 vis 
that of the sampling variance of the estimate (variance of position) needs to be 
made clearer. 

I have noticed only two rather obvious typographical errors: On p. 24, line 6, 
.4637 should be replaced by .04657, and on p. 215, line 1, .05450 should be replaced 
by .95450. 


*Henry Schultz, The standard error of a forecast from a curve. J. Amer. Stat, Asso. $5 (1930), 
139-85. 
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VESSEREAU, A. Méthodes Statistiques en Biologie et en Agronomie, Tome 
8 deuxiéme. Paris: J.-B. Bailliére et Fils, Editeurs. Nouvelle edition entiérement 
refondue, 1960. Pp. 540. 55 NF. 


P. Roprnson, Canada Department of Agriculture, Otiawa, Ontario, Canada. 


This book, as the title implies, is intended principally for research workers in 
the biological sciences. There are now several good texts available in English, 
but there are few in the French language. In his introduction to the first edition 
the author says “Le pfesent ouvrage se propose de combler cette lacune.” That 
there was this need may perhaps be judged by the short time elapsing between 
the first and second editions. The second edition has been completely revised and 
now includes— 

(i) a fuller discussion of the underlying mathematical reasoning, 

(ii) a more general description of the analysis of variance. 

(iii) a more extensive treatment of experimental design, including confounding, 

and 

(iv) tables and graphs relating to the binomial and Poisson distributions. 


This book contains a great deal of useful information and the mathematical 
treatment is kept simple throughout. Basic ideas on probability and sampling 
are discussed in the first four chapters, simple tests of significance in the next two 
chapters, experimental design in the next nine, and regression and correlation in 
the last two. 

In tovering such a wide field within the confines of a single book, any treatment 
in depth is of course precluded. The broad approach to the subject may have led 
the author to include descriptions of higher order moments, the 8; and 6; measures 
of skewness and kurtosis, and Sheppard’s cqrrections. These, however, are com- 
pletely unnecessary in an introductory text of this nature intended for the biologist, 
as is evidenced by the fact that they do not reappear after Chapter IT. It is also a 
pity that outdated measures of dispersion—the probable error (pp. 56 and 100) 
and the mean error (p. 100)—are discussed. These are interesting from a historical 
viewpoint, but only serve to confuse the agricultural worker. 

It would have been better to sacrifice breadth of approach in these areas in 
order to deal a little more fully with problems of more direct concern to the biologist. 
In particular the sections on transformations, split plots, and the combined analyses 
of repeated experiments could have been expanded. A discussion on expected mean 
squares might have helped when dealing with this last problem; the correct choice 
of the appropriate error for testing a particular hypothesis would then have been 
clearer, instead of appearing to be left to the whim of the experimenter (p. 419). 

A fuller discussion of the difficulties involved in multiple comparisons would 
also have been helpful. Comnignts (2) and (3) on pp. 187 and 188, for example, 
appear to be contradictory, and the discussions on pp. 203, 204, 206 and 216 could 
be misleading. 

The use of artificial data in certain illustrations (pp. 193, 458) is unwarranted 
when so many examples are now available. This is particularly dangerous when 
dealing with the analysis of covariance because the interpretation can be tricky. 
The author’s suggestion of circumstances in which as covariance analysis could be 
used—p. 451—‘“Lorsqu’ on étudie |’influence des engrais sur |’indice de pureté du 
sucre extrait d’une culture de betterave sucriére, les résultats dépendent du poids 


4 
4 
in 
4 


BOOK REVIEWS | 163 


des racines, qui n’est pas constant dans les différentes parcelles de |’essai, mais qui 
peut étre mesuré aprés coup”—is a particular case in point. 

Finally I must disagree with a piece of advice which the author gives in his 
“En guise de conclusion.” He suggests, if there is any* doubt in the mind of the 
research worker as to the applicability of some of the techniques discussed, that 
“mieux vant encore s’en tenir aux schémas éprouvés.”” This value of this advice 
depends entirely upon what is meant by ‘‘schémas éprouvés’”’. In general, it would 
be far better to advise the research worker ‘‘de demander conggeil aupres des experts 
en statistiques.” 


LOVE, A. G., HAMILTON, E. L. and HELLMAN, I.L. Tabulating Equipment 
9 and Army Medical Statistics, Washington, D. C.: Office of the Surgeon-General, 
U. S. Army Medical Service, 1958. Pp. x + 202. 36 Figures. $2.00. 


W. W. Houtanp, London School of Hygiene and Tropical Medicine, London, 
Engiand 


This is a delightfully written account of the development of machine methods 
and of the techniques of recording in the American Army. The book not only 
traces the development of statistical services, but also shows how great the de- 
pendence of an efficient military medical service is on the collection, tabulation and 
assessment of data. After reading this short monograph it is easy to understand 
how the excellent description of such diseases as cholera, respiratory infection and 
of injuries sustained by American Army personnel during the two World Wars 
could be completed. It is of interest to read that, even in the first years of the 
existence of the American Army, during the Napoleonic Wars, acute respiratory 
disease was s. severe problem. Although the treatment tends, at times, to be some- 
what too bidgraphical, a good account of the development of punch cards, sorting 
and tabulating systems and of computers is given. Full details and illustrations 
of the various forma used for recording data are described as well as an accountgpf 
the collection and analysis of anthropometric information obtained during routine 
assessment of personnel on enlistment and discharge. 
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ABSTRACTS 


The following are abstracts of papers presented at the annual meeting of the Biometric 
Society EN AR, held jointly with the Biometric Society WN AR, the American Statistical 
Association, and the Institute of Mathematical Statistics in Stanford, California on 
August 23 to 26, 1960. Abstracts of additional papers presented at this meeting appear 

in other society publications. 


KLAUS ABT (U. 8S. Naval Weapons Laboratory, Dahlgren, Va.). Analysis 
of Variance of Differences Versus Analysis of Covariance. 


In analyzing paired data (z; y), classified in one or more directions, where 
z and y are observations of the same variable (and therefore usually observed at 
different times) at least two methods are available. First there is the analysis of 
covariance in which y is considered to be stochastically dependent on z, the “‘in- 
dependent” variable. A second method is the analysis of variance of the generalized 
differences, d = y — fot, briefly called ‘‘analysis of differences’. The case Bo = 1 
represents the most important case of ordinary differences. 

It is shown that, if the condition E(bg) = {Ao is satisfied (“expectation of the 
coefficient of error sum of squares regression line equals 6’, which easily can be 
tested), the analysis of differences has a power equal to or even higher than that 
of the analysis of covariance. From many calculated examples, it seems that the 
condition mentioned is satisfied quite often. In many cases, therefore, the analysis 
of differences will be the appropriate method for the data to be analyzed, besides 
this including a minimum of computation. 


R. L. ANDERSON (North Carolina State College, Raleigh, N. C.). Recent 
Developments in Multivariate Analysis. 


One of the most important recent developments in multivariate analysis has 
been the publication of some excellent textbooks on the subject, on both theoretical 
and applied aspects. This paper is in the nature of a discussion of what the writer 
feels is needed to improve multivariate techniques. The chief need is much more 
attention being paid to construction of experimental models, usually involving two 
or more equations, and methods of estimating parameters in these models, assessing 
their usefulness and testing hypotheses concerning them. Particular attention is 
paid to causal chains and to dynamic systems. For the latter, methods based on 
lagged dependency models and Laplace transform techniques are indicated. : 

. Some possible weaknesses of current multivariate analysis of variance procedures 
are indicated. Problems which should be studied in the following fields are outlined: 
discriminatory, canonical, factor and component analysis. Special attention is 
focused on the construction of selection indexes, and the selection of predictors in 
ordinary multiple regression analyses. 
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685 ROLF E. BARGMANN (Virginia Polytechnic Institute, Blacksburg, Va.). 
On the Problem of Ordering Variables in Tracing Significant Contributions. 


After rejection of some general linear hypothesis in multivariate analysis it is 
frequently desirable to ascertain how much each variable contributes to the ob- 
served differences. The step-down procedure—i.e., a process analogous to analysis 
‘of covariance in which we study the first variable, the second given the first, the 
third given the first and second, etec-—depends on the ordering of the variables. 
The present report discusses the use for this purpose of the correlations of a linear 
discriminant function a’X, where a’ is the eigenvector associated with the largest 
root of HE“! (H = matrix of 8.S. and S.P. due to hypothesis, EH same due to error). 
A similar procedure is described for the discrimination of common factors, i.e., for 


those cases where any difference between groups is supposed to be in terms of those’ 


artificial components only which are shared by two or more of the observable vari- 
ables. A detailed demonstration study will be presented for illustration. 


EDWARD C. BRYANT and DONALD W. KING (University of Wyoming, 
686 Laramie, Wy.). Estimation from Populations Identified by Overlapping 
Sampling Frames. 


Consider a population of unknown size identified by p sampling lists, each with 
M; entires from which independent samples of size m; , i = 1, 2, --- , p may be 
drawn. It is assumed that the sampling procedure will reveal which of the p lists 
contain the individuals interviewed. An estimate of the total number in the popula- 
tion is desired, as well as the number of individuals appearing on each possible 
combination of sampling lists. Maximum likelihood and minimum modified chi 
square estimates are considered. By moderate restrictions, an approximation to 
the variance is provided by use of the Taylor’s series expansion around the expected 
numbers in each category of listing combinations. 


687 BRADLEY E. COPELAND (New England Deaconess Hospital, Boston, 
Mass.). Problem in Pathology With Statistical Aspects. 


The pathologist in his role in diagnostic laboratory medicine finds a compelling 
need for a broader comprehension of the fundamental principles of statistics for 
himself and on the part of the practicing physician, the medical taehnologist, and 
the medical research worker. 

Every physician must be equipped with certain statistical tools to properly 
utilize the laboratory measurements now available to him for diagnosis and treat- 
ment. 

There must be clear and rapid communication between the pathologist and the 
clinician concerning the variation in normal values and the reliability and inherent 
variability of laboratory measurements. 

The first major problem relates to the barrier of statistical nomenclature and the 
multiplicity of statistical formulas. Communication cannot be successful until 
pathologists and clinicians learn to handle with ease such fundamental statistical 
concepts as standard deviation, standard error, variance, the ¢ test, and the F ratio. 

Since medicine is concerned with the individual, there is a need to clearly 
identify the tools which separate the diseased individual from a normal population 
as opposed to the tools which differentiate normal and abnormal populations. 
Both are useful devices, but they are the source of much confusion in the minds 
of physicians. 
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Currently there is an also pressing need for more comprehensive definitions of 
normal values in all laboratory measurementse—normal as regards variation in age, 
sex, geographical location, and any other environmental or hereditary factor which 
may influence the normal population. 

To achieve permanent indoctrination of the medical profession with useful 
statistical tools is a necessary and useful goal. It is in this area that active communi- 
cation between pathologists and statisticians should take place to develop solutions 
to the problems described. 


6sg R- C. ELSTON (University of North Carolina, Chapel Hill, N. C.). On 
Additivity in the Analysis of Variance. 


The problems of testing for non-additivity and of finding the best transformation 
of the data to minimize it are discussed. Consider the model: E(ye) = 4 + be + Cus» 
where y, is an observation belonging to the g-th subclass in an analysis of variance 
problem. If this model fits our data, and if max pg — min wg < 1/| c | (the range 
over which this model is a strictly monotonic function of u,), then there exists a 
useful transformation of our data that will give additivity; that is the transformed 
data will be fitted by an additive modél. Two tests that have already been pro- 
posed for testing for non-additivity are tests of the null hypothesis c = 0 against 
c ~* O under models that can be considered as approximations to the above model. 
It is shown how estimates of » and c can be used as guides in finding an appropriate 
transformation of the data. 


689 WALTER T. FEDERER (Cornell University, Ithaca, N. Y.). Augmented 
Resigns with Two-, Three- and Higher-Way Elimination of Heterogeneity. 


Augmented experimental designs are standard ones to which additional treat- 
ments have been added. The additional treatments may or may not be replicated 
the same number of times as the treatments in the standard design or the same 
number of times as other new treatments. The groupings of treatments into rows, 
columns, complete blocks, ete. may be such that the size of the group is or is not 
equal. Construction procedures, randomization procedures and the analyses in 
general form have been presented for augmented experimental designs with two-, 
three- and higher-way elimination of heterogeneity. 


600 WILLIAM R. GAFFEY (California State Department of Public Health). 
Tests of Hypotheses Concerning Boundedness in Convolutions. 


Suppose the random variable Y, with unknown d.f. G(y), is observed subject 
to error, so that X = Y + Z is the random variable actually observed. Let H,(z) 
be the (known) conditional d.f. of the continuous “error” random variable Z, and 
let F(X) be the df. of X. We are concerned with deriving tests of the following 
two-hypotheses about the distribution of the “true” random variable Y ;(1) G(y) = 0 
for y < a, and (2) Gy) = 1fory > b. 

Under suitable restrictions on the form of H,(z), it turns out that hypothesis 
(1) is true if and only if F(X) < H.(z — a) for z < a, and hypothesis (2) is true 
if and only if F(X) > Hz — b)forz >». A test of either inequality is therefore 
a test of the corresponding hypothesis about G(y). 


A ‘test is proposed using the one sided Kolmogorov-Smirnov etatistic, restricted 
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to the appropriate portion of the range of X. The large sample distribution of the 
statistic is derived, and the consistency of the test is established. 


MAX HALPERIN (General Electric Co., Schenectady, N. Y.). Nearly 
Least Squares Estimates in Heteroscedastic Regression. 


Suppose we have observations y;;, = 1, kj j = 1, , 
where Ey;; = a + ba; and Var y;; = oi , the z; are presumed known, and the yi; 
are normally distributed, mutually independent, and k > 2, minn; > k. In this 
paper we show how to generate estimates of a and b which are nearly least squares 
in the sense that the correlation of the estimates is identical to that which would 
be obtained from a least squares fit with known variances while the variance of the 
estimates differs from that appropriate to the known variance least squares fit by 
a multiplicative factor which approaches unity as min; n; — © and k remains con- 
stant. The procedure further allows estimation of the covariance matrix of the 
estimates with (min; n; — k + 1) degrees of freedom and hence an exact confidence 
region for a and b based on Hotelling’s T? distribution. The results cited are of a 
conditional nature in which the conditioning constrains the variance multiplying 
factor referred to above. The distribution of the multiplicative factor aside from 
known constants is that of Hotelling’s central 7? in general (Student’s ‘‘t” if k = 3). 
As a consequence, alternative exact confidence intervals of unconditioned nature 
are available in principle by averaging the conditional results over the appropriate 
distribution. It is conjectured that matching moments to determine approximate 
degrees of freedom for an “‘equivalent’’ T? distribution is adequate from a practical 
point of view. If k = 2, the procedure we propose gives estimates with exactly 
least squares variances and exact confidence intervals or regions. It is conjectured 
that results similar to the above generalize to general polynomial or polyvariable 
regression linear in the parameters and a sketch of the type of argument is given. 
692 R. C. HENNEMUTH (Inter-American Tropical Tuna Commission). Esti- 

mating Vital Statistics of Yellowfin Tuna Populations. 


Knowledge of vital statistics is essential to the study of the dynamics of ex- 
ploited fish populations; age composition, growth rate and mortality rates being 
particularly important. 

Determination of age of yellowfin tuna by using marks on scales or bones has 
not proven reliable; however, growth rate and relative age have been estimated 
by observing the temporal changes in the size composition of catch. Absolute age 
has been approximately determined by comparing the average time of spawning 
with the time and size at which the age groups initially appear in the catch. 

Total mortality rate has been estimated by analyzing catch-curves of successive 
year classes, which have been computed from age composition and indices of apparent 
abundance based on catch per unit effort data. 


BURTON J. HOYLE and GEORGE A. BAKER (University of California, 
Davis, Cal.). Game Theory Applied to Field Trials. 


The determination of the proper variety of a cereal to grow may be considered 
aS a game with the Grower as one opponent and Nature as the other. A game 
matrix of forty-one independent yield trials on nine strains of Hannchen barley is 
presented. The trials cover three years and very diverse designs and environments 
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within the years. A similar but more restricted matrix for kernel weight is also 
presented. The whole structure of field trials is made clear by a contemplation of 
these game matrices. Sometimes one strain does better and sometimes another. 
That is, it is doubtful that there is a best strain. Sometimes all strains are close 
together and sometimes the strains differ widely. Some strains are very erratic, 
in relative yield, and some are very consistent. Strains highest with respect to 
yield are not necessarily good with respect to kernel weight and other desirable 
properties so over-all ratings involve weighting considerations on various character- 
istics. 

It is clear from dominance considerations that some strains should seldom or 
never be grown. in other cases mixtures of strains thay be better than any one 
strain. 

A determination of the odds for the different strains depends on the distribution 
of nature’s strategies. 


604 AUGUSTUS C. JOHNSON (Booz-Allen Applied Research, Inc.). A Stochastic 
Model of Incubation-Period Distributions. 


An independent-action birth-death model of infection is presented, based on the 
hypothesis that each organism has probability \ dt of dividing and » dt of dying 
during the time interval dt. \ and yu being constants. Incubation terminates when 
a colony reaches a lower absorbing barrier at o or an upper absorbing barrier at 
some number N, at which point a sign or symptom appears. 

The resulting differential-difference system is solved to give expressions for 
the distribution of conditional probability that n organisms will grow to an upper 
absorbing barrier N between times o and ¢. Expressions for the moments are derived 
both from the distribution and directly, being exhibited explicitly for moments 
through the third. 


CECIL L. KALLER and VIRGIL L. ANDERSON (Purdue University, 
695 Lafayette, Ind.). An Environmental Extension of Chromosome Analysis in 
Population Genetics. 


A phase model has been developed for the special case of a diploid diallelic 
genetic population having K pairs of chromosomes in genetic equilibrium which are 
influenced by Q tri-level environmental factors. The theoretical structure for this 
development is based on the theory of factorial experimental designs with each 
of the K + Q factors having three “levels.” Estimates of the parameters of the 
population described by this model are obtained by extending the methods presented 
in a thesis by McKean (Purdue, 1958). This sampling method involves the formula- 
tion of a “chromosome population” by the random selection of a set of K chromo- 
some pairs from the original genetic population. The chromosome pairs so drawn 
are combined to form all possible genotypes; individuals of each resulting genotype 
are placed in all 3@ possible environmental level combinations in numbers propor- 
tional to their theoretical frequency. Under the assumption that all interactions 
involving three or more factors are sero, an analysis scheme is developed for the 
estimation of chromosome population parameters. Therefrom, by the relationship 
demonstrated between the chromosome population parameters and the corre- 
sponding parameters of the original genetic population, conditions are established 
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MARVIN A. KASTENBAUM (Oak Ridge National Laboratories, Oak Ridge, 
Tenn.). Countercurrent Dialysis—A Stochastic Process. 


Peptides isolated from rat liver microsomes may be separated from contami- 
nating amino acids, sugars and salts by countercurrent dialysis. A single such 
dialysis can achieve only fractional purification. Therefore, more complex systems 
of multiple dialysis, in which a linear series of cells are used, have been devised. 
The system of immediate interest may be described as follows: Let each dialysis 
cell be a stage, and each dialysis period be a cycle. Then after each cycle, the 
dialout (the solution outside the dialysis sac), at each stage is concentrated and 
used as the dialin (the solution inside the dialysis sac) in the succeeding stage, 
whereas the dialin at each stage is concentrated and returned to the dialysis sac of 
the previous stage. The dialin at the first stage is retained at the first stage, and 
the dialout at the final stage is taken out of the system. In the nomenclature of 
stochastic processes, the first stage represents a reflecting barrier and the last stage 
an absorbing barrier. 

To determine the probability with which a particle of the isolate will be at a 
specified stage in the system after a given number of cycles have been carried out, 
the system is described in terms of a. Markov process with a stochastic matrix re- 
sulting from the product of two matrices, namely, a diffusion matrix and a transfer 


matrix, and algebraically explicit solutions are derived for any number of stages 
and cycles. 


JOSEPH KEILIN (U.S. Dept. of Health, Education, and Welfare, Washing- 
697 ton, D.C.). The Use of the Information Statistic as a Measure of Conformity 
in Comparing Two Sets of Responses. 


A morbidity survey of the city of Nashville and suburbs was recently conducted 
by the Air Pollution Medical Program. In order to obtain some measure of the 
reliability of responses a subsample was chosen for reinterview. 

Two methods of looking at the data are discussed and two functions are pro- 
posed for measuring the proportion of information transmitted from the ordinal 
interview. The two methods dictate different null hypotheses and these in turn 
lead to the two different functions. 

The results from the reinterview are presented and show relativeiy low amounts 
of information transmission. Further analysis of the data, using information theory 
procedures, is suggested. 


698 JOHN G. KEMENY (Dartmouth College, Hanover, N. H.). General Remarks 
on Markov Chains with Illustrated Examples in Biology. 


Part I of the paper deals with finite Markov chains. It is shown how, through 
intelligent classification of various types of chains and the use of modern matrix 
techniques, many fundamental problems can be reduced to routine computations. 
Two fundamental matrices are developed, one for absorbing and one for ergodic 
chains, in terms of which basic quantities can be simply expressed. As applications 
of these mathematical techniques, problems in genetics are considered. 

Part II of the paper deals with some recent joint research carried on with 
J. L. Snell, concerning denumerable Markov chains. It is shown that extensions of 
the above-mentioned matrix techniques to denumerable chains result in an interest- 
ing discrete analogue of classical potential theory. While the discrete version is 
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simpler than the classical potential theory, it can be vastly generalized by extending 
it to a wide class of denumerable Markov chains. This has been done previously 
for transient chains, but the extension to recurrent chains is new. The fundamental 
matrices extend to basic potential operators. Applications of these results are 
sketched to population problems and to the spread of epidemics. 

Throughout the paper, stress is laid on the methodological advantages of setting 
up a Markov chain model. A number of key results are stated without proof and 
several unsolved problems are mentioned. 


OSCAR KEMPTHORNE (Iowa State University, Ames, Iowa), and R. N. 
699 CURNOW (University of Aberdeen, Aberdeen, Scotland). The Partial 
Diallel Cross. 


The diallel cross, which is composed of all possible single crosses among a group 
of inbred lines, is a common plan of investigation in plant and animal breeding. 
It can be used to estimate general and specific combining abilities and variances, 
to estimate components of genotypic variation and to estimate yielding capacities 
of multi-way crosses. With n lines it does however involve n(n — 1)/2 crosses, 
even if maternal effects can be ignored. Breeding programs can easily lead to 50 
inbred lines which merit examination, so the diallel cross becomes unfeasible. This 
paper deals with sampling of the complete diallel cross table in a specific say, namely : 


(1) arrange the lines in random order 
(2) obtain the crosses of line 7 with lines 


where k = (n + 1 — s)/2 is an interger and all number above n are reduced 
by a multiple of n so as to be between 1 and n. 


Each line is crossed then with s lines and s = n — 1 corresponds to the complete 
diallel cross. This partial diallel cross is considered in the following respects: 


(1) estimation of variance components 
(2) comparing the yielding capacities of all possible crosses and . 
(3) estimation of general combining abilities, 


The efficiency of the plan with regard to each of these aspects is evaluated and 
discussed. It is shown to be more efficient than some other possible plans under 
some circumstances. 

BERTRAM S. KRAUS (University of Washington, Seattle, Wash.). The 


_ Study of Human Growth: Some New Horizons. 


Researchers in biometrics, genetics, and physical anthropology have a fertile 
field for miscegenation in the study of growth. An example is the timely and signifi- 
cant problem of the possible genetic effects of irradiation on future human genera- 
tions. Thus far workers in this field have failed to demonstrate that radiation 
causes mutations in the human species, although this can be assumed from experi- 
ments with other mammals. Perhaps this failure can be associated with insufficient 
knowledge or recognition of the nature of growth and the role of the genetic consti- 
tution in controlling or regulating growth. Perhaps, too, there has been too much 
emphasis placed upon clinical classifications of abnormalities (congential malforma- 
tions) and not enough on the less dramatic but none the less real deviations in 
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patterns and rates of growth. Equally significant is ile failure to define abnormality 
in terms of the statistical characteristics ot the given population. 

Attention is focused in this paper upon the distribution of certain skeletal 
phenomena in prenatal life—sequence of appearance of centers of ossification in 
the foot, changing long bone inter-relationships, and appearance of centers of cal- 
cification in the primary dentition. Norms for human populations are estimated for 
these traits. It is then suggested that significant deviations from norms established 
ia control populations may reflect higher frequencies of mutant genes in a test 
population. A method is proposed for the study of the genetic effects of irradiation 
in Hiroshima and Nagasaki wherein the biometrist, the geneticist, ind the physical 
anthropologist may pool their skills and concepts. 


B. KURKJIAN (Diamond Ordnance Fuse Laboratory) and M. ZELEN 
701 (National Bureau of Standards, Washington, D. C.). A General Theory for 
Analysis of Asymmetrical, Confounded Factorial Experiments. 


A general theory is developed for the analysis of factorial experiments involving 
an arbitrary number of factors and levels of each. The treatment-combinations are 
assumed to be assigned to an appropriate incomplete block design and the treatment 
estimates, #’s, are computed in the usual way, ignoring the factorial correspondence 
between the treatments. New results are presented for obtaining the best linear 
estimates (and the associated var-cov matrix) of the various main effect and inter- 
action parameters in terms of the #’s. A condition on the var-cov matrix of treat- 
ment estimates, V, , is provided which insures that the sum of squares of the various 
main effect and interaction terms is distributed as chi-square. Furthermore, a 
certain large class of partially balanced incomplete block (PHIB) designs with 
arbitary number of associate classes is described which can be treated by methods 
of this paper. The theory is applied to two PHIB designs involving three associate 
classes. 


702 ROBERT S. LEDLEY (George Washington University, Washington, D. C.). 
A Sequential Decision Theory Applied to Medical Diagnosis. 


The reasoning foundations of medical diagnosis involve two well-known mathe- 
matical disciplines; symbolic logic and probability theory. Value theory then can 
aid the choice of an optimum treatment. 

Three factors are involved in the logical analysis: medical knowledge relating 
disease and symptom complexes; the symptom complex of the patient; and the 
disease complexes that are the final diagnosis. Each of these can be expressed as a 
Boolean function. The ease of making diagnostic tests varies greatly; a realistic 
method of diagnosis uses easier tests to select among more difficult tests. 

Using probability theory, a ‘niost likely’ diagnosis is determined by applying 
Bayes’s formula to the conditional probability that a disease complex will produce 
& particular symptom complex and the total probability that any person chosen 
from the sample under consideration will have that disease complex. Certain 
problems arise in computing statistics for the probability determinations; the 
difficulty in obtaining ‘sufficiently large’ data and the time delay in gathering data. 

Value concepts in medical diagnosis and treatment are concerned, for example, 
with decisions on whether to continue further testing, or what are the values of 
particular alternative treatment—diagnosis combinations. We distinguish three 
kinds of problems in treatment decision; under certainty, i.e. when the disease is 
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definitely known; under risk, i.e. when alternative diagnoses have been made with 
known probabilities; under uncertainty, i.e. when alternative diagnoses remain 
with no information concerning the probabilities. 


703 K. H. LU (Utah State University, Logan, Utah). The Means and Variances 
of the Products of Two or Three Normal Variables. 


Given normal variables ¢; , ¢2 and ¢; with means y: , 2 and ws, variances o11, 
o22 and o33 respectively. Let ¢: and ¢: follow the bivariate normal distribution with 
moment generating function: 


M(t,t.) = exp. uit, +3 


Also let ¢; , ¢2 , and ¢; follow the trivariate normal distribution with moment gener- 
ating function: 


3 3 
M(t,t.t;) = exp. > uit; + 4 7. 
i=1 tj=1 
We define new variables z and y as follows: 
z= tt,, and y = respectively. 


It can be shown that by differentiating M(¢,t2) and M(titsts) to obtain the appro- 
priate partial derivatives evaluated at 4; = 0 and substituting into the definitions 
of means and variances, we have: 


= Milla + O12, 

= + + + + pie), 

My = + + + 

Soy = + + + + 
+ + pis) + pos) 
+ + 2pi2 2pis + + 8p12013623) 
+ + 4012033) + + 4013022) 
+ + 402301:). 


H. M. C. LUYKX and BETTY L. MURRAY (office of the Surgeon General 
U.S.A.F.). Durations of Iliness. 


Using epidemiology in the broad sense of the word, one of the epidemiological 
characteristics of a disease is its duration pattern. Not only the mean duration 
will vary from one diagnostic entity to the next, but so will the shape of the distri- 
bution of durations, which can be described in several ways. In this paper a picture 
is presented for each of a number of specific diseases. For example, graphs are 
shown for 368 cases of infectious hepatitis, 446 cases of mumps, 879 cases of pilonidal 
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eyst and 2177 cases of strevivcvceai sore (uroat. These and viher charts illustrate 
the techniques of analysis aud presentation, und the use maue ©: vic data. 

The curves are ‘‘cases—remaining” curves, whieh are strictly analogous to 
survivorship curves commonly given with life tables. A e:!.ort begins its episode 
of illness at the zero point on a scale of days, and diminishes as each case terminates. 
The cases—remaining curves are more useful than ordinary frequency polygons i 
(on a scale of duration). They show at a glance the range and skewness of the . 
duration distribution, the modal day for return to duty, and the median and other 
percentile points on the “days-atter-admission” scale. 

The Air Force medical data are unique in that they provide unusually large 
irequencies, derived from uniform records and recording procedures, showing di- 
agnoses established by a physician in each case. In this presentation, the beginning 
of an illness is excusal from duty by a physician, and the end is return to military 
duty; cases not returned to duty (deaths or disability separations from the service) 
are not included. 


Definitions of illness, diagnostic entity, disabiliiv, admission, return to duty, i 
etc., are also discussed, ang a technical note deseribes the incthod of computing 3 
confidence intervals for percentile points. 


705 CLIFFORD J. MALONEY (U.S. Army Chemical Corps, Ft. Dietrick, Md.). 
Disease Severity Quantitation, II. 


It was previously shown (Disease Severity Quantitation. Paper read at 
Washington, D. C. 1959 Annual Meeting of the American Statistical Association) 
that symptoms of rats exposed to aerosol challenge of B tularense conform closely 
to a scaling model due to Guttman (Stouffer, S. A., Measurement and Prediction, 
1950, Princeton U. Press). 

Necessary and sufficient criteria of scalability are that symptom pairs yield 
four-fold contingency tables in which one, but not both, of the secondary diagonal 
cells are zero, and that the zero cells in all tables can be arranged in a linear order 
in either symptom designator. 

The present paper points out that (in the absence of experimental accidents) 
the sum of the four cells in every table is fixed, that each table divides the entire 
disease severity scale into three intervals. In the first, the disease severity is too 
slight for either symptom to appear. In the second, the more sensitive symptom 
appears, but not the other, and in the last segment, the disease severity is such 
that both symptoms appear. The lower boundary is fixed by the milder symptom oF 
and is located at the same severity level whatever the other member of the pair; i oe 
similarly for the upper level. Hence, a system of linear equaticns can be formed 
in the segment frequencies and in the segment lengths. 

These sets can be equated and expressed in matrix notation. If the experimental 
data is perfect, the matrix will be triangular. If the data are subject to random 
error, the further treatment can be carried out in terms of a matrix subject to random 
error. 


H. E. MCKEAN and B. B. BOHREN (Population Genetics Institute, Purdue | et 
706 University, Lafayette, Indiana. U. S. A.). Numerical Aspects of the Regres- | = 
sion of Parent on Offspring. tr 


The relative merits of three methods of estimating heritability from the 
regression of offspring on one parent’s record, from the point of view of statistical 
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efficiency, are brought out with respect to five generations of a closed poultry flock 
previously reported by Yamada, Bohren, and Crittenden (Poultry Science, 1957). 
The three methods discussed are: (i) The regression of offspring means on parent 
records. (2) The regression of offspring records on parent records, in which the 
parents’ record is repeated once for each progeny. (3) The Kempthorne—Tandon 
weighted technique (Biometrics, 1953), in which account is taken of the correlation 
coefficient » between errors associated with progeny of the same parent. 

The latter technique (3) depends upon knowing p exactly; in this case the 
estimation technique is optimum. If p is unknown, p must be estimated or guessed; 
in which case (3) is not optimum. The loss in efficiency in misguessing p is discussed, 
over. a range of possible values of p. A comparison is simultaneously made with 
methods (1) and (2). . 

The general conclusions, based on these data are: (i) method 1 is distinctly 
inferior to methods 2 and 3 in most conceivable situations; (ii) if p is small (which 
it is if the genetic variance is primarily additive and maternal effects are minor), 
then method (3) is only slightly more efficient than method (2). Indeed, method 
(2) is simply a special case of method (3) with p always guessed to be 0. 


707 HARLEY B. MESSINGER (University of California, Berkeley, Cal.). A 
Geometrical Model for Height-Weight Control Charts in Schoolchildren. 


The primary objective of this study was to obtain a satisfactory screening 
device, i.e., a way to give a yes-or-no answer at any time tothe question, ‘‘Are this 
child’s height and weight within the ranges one would expect knowing his age, sex, 
and previous heights and weights,’’ To accomplish this, a set of tolerance limits 
were derived from the combination of cross-sectional studies of two functions of 
height and weight in one healthy group of children and longitudinal studies of 
these functions in each individual of a second healthy group with respect to the 
cross-sectional standards set by the first group. Because the functions were approxi- 
mately independent, separate control charts could be used. 

A secondary objective of the study was to seek interpretability for the control 
charts. In an interpretable chart, a child’s ratings at any time can be related di- 
rectly to his physical characteristics. A class of geometrically-meaningful functions 
arising from the use of a cylinder to approximate the human body was examined 
in the Institute of Child Welfare “(Guidance Study” data. The height (H) of the 
cylinder and the diameter of the cylinder relative to the height (D/H) were es- 
sentially uncorrelated within each age group from 4 to 14 years. Individual children 
of the “Berkeley Growth Study’ were studied longitudinally. Their H and D/Y 
functions at each age were studentized to obtain standard- or z-scores. From 
studies of these sequences as stationary autoregressive time series, reasonably 
efficient control charts were obtained. The D/H function, called “relative broad- 
ness,” can be gotten from a nomogram. No arithmetic computations or standard- 
score tables are needed for this kind of chart. 


708 CHARLES J. MODE (Montana State College, Bozeman, Mont.). On The 
Theory of the Improvement of Metric Traits In Outbreeding Populations. 


An attempt was made in this paper to characterize the change in the mean 
and genetic variance of a metric trait under a program of improvement. The 
approach used in this paper differs from other approaches by carefully distinguishing 
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between the fitness of a genotype under a program of improvement and the metric 
trait. Only the single locus case with an arbitrary number of alleles was considered, 
but the model may be interpreted in such a way as to include the multi-locus case. 

Steady state populations.and the problem of stability of a steady state were 
also considered. The existence of steady states and the question of their stability 
was discussed for constant and non-constant measure of fitness. 

The conditions for the existence of an optimal value for the mean of a metric 
trait when two or more alleles are maintained in a population under a program of 
improvement were also given. 


W. S. OVERTON and A. L. FINKNER (North Carolina State College, 
709 Raleigh, N. C.). The Sample Road-Block Method of Estimating Hunting 
Pressure. 


The sampling method described was developed for use on Game Management 
Areas in Florida, but is applicable in situations where estimates of hunting pressure 
are desired for areas having few or no inhabitants. The method involves the sampl- 
ing of access points. e 

Under a given definition, it is possible to express “‘hunting pressure” in terms 
of total persons entering and leaving the area by time period. An expression is 
developed for the definition used in Florida, which forms the structural basis for 
sampling. Total hunting pressure is divided into a daily component and a com- 
ponent due to camping; only the former is considered in this paper. 

The daily component of hunting pressure is a simple linear combination of the 
time period totals of entries and exits. There are a number of sampling systems 
that will give unbiased estimates of the totals by time periods, and, hence, unbiased 
estimates of the daily component. A system in which an observer measures & 
randomly selected station at alternate time periods with the starting time randomized 
among days was chosen because of its simplicity and ease of operation. Time periods 
are considered strata, and the usual stratified sample estimates of totals and variances 
are employed. This system does not yield unbiased estimates of variance, but 
seems to be satisfactory for present purposes. 

Alternative sampling systems and corresponding estimates of variance are 
considered and evaluated. 


719 D- 8. ROBSON (Cornell University, Ithaca, N. Y.). Cumulant component 
analysis in balanced designs. = 


Cumulant components, as a direct extension of variance components, described 
the manner in which the usual assumptions on the linear model are violated. The 
unique, minimum variance unbaised estimators of cumulant components for the 
distribution-free, balanced model are easily determined because of the existence 
of a complete, sufficient order statistic. One computational procedure for con- 
structing these estimates is an algebraic extension of the analysis of variance to the 
“analysis of cumulants”’. 

Under the usual Model II assumptions of normality and independence of effects, 
the components of all cumulants beyond the second are zero. Tests of departure 
from sero, therefore, constitute tests of the usual assumptions. Several such tests 
are available. 
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71; 3B. V. SHAH (Iowa State University, Ames, Iowa). Asymmetrical Factorials 
in Incomplete Blocks. 


The primary purpose of this paper is to summarize the recent developménts in 
this subject and to indicate possible lines of attack for future research. After a 
brief summary on history of developments, the desirable properties of factorial 
experiments, namely orthogonality and balance are discussed. The analysis and 
the construction of the designs with above properties are studied. As a consequence 
of relaxing the condition on balance we obtain a wider class of designs, which may 
be called partiaily balanced factorial experiments. Finally, the use of pseudo- 
factors and some other aspects of these designs are studied. 


712 R- G. D. STEEL (Cornell University, Ithaca, N. Y.). A Veriable Rank 
Sum Multiple Comparisons Test. 


A variable rank sum is used for the comparison of all pairs of treatments. This 
sequential method of testing should be more powerful than a procedure using a 
fixed rank sum and an experimentwise error rate. Computation of exact probabilities 
and appropriate tables is discussed. 


713 - PATRICK SUPPES (Stanford University, Stanford, Cal.). Foundations 
of Measurement. 


This paper is concerned with the axiomatic foundations of measurement. The 
procedures and operations of measurement can be represented as a relation algebra 
in the sense of Tarski. The two formal theorems which demonstrate the adequacy 
of a particular scheme of measurement are the representation theorem which estab- 
lishes that the algebra of empirical relations can be mapped homomorphically into 
the appropriate structure of the real numbers, and the uniqueness theorem which 
establishes that the homorphism is unique up to the appropriate group of transfor- 
mations. Some empirical examples of these ideas are given and some problems of 
axiomatizability are mentioned. 


WILLIAM F. TAYLOR (University of California, Berkeley, Cal.) and 
714 JOSEPH BERKSON (Mayo Clinic, Rochester, Minn.). A Problem of Testing 
and Estimation Involving Four Fold Tables. 


In this paper, the old problem of the analysis of four fold tables is re-examined 
with respect to a peculiar kind of sampling. Assuming a population possesses 
properties A and B, consideration is given to several ways in which samples might 
be taken in order to determine whether A and B are associated. 

Case 1. Simple random sampling. N members of the population are drawn at 
random and the frequencies of occurrence of A, B; A, not B; not A, 8; and not A, 
not B are observed. This is the classical, simple sampling procedure. 

Case 2. Sampling with respect to A. By this is meant the random and inde- 
pendent selection of N,; members with property A and N; = N — N;, members 
without property A. N; and N; are fixed in advance and the number of cases of 
B and not B are observed in each of these two groups. 

Case 3 is the reverse of case 2, namely, sampling with respect to B. 

In case 4, estimates of various parameters are found by one of Neyman’s methods 
for obtaining best asymptotically normal estimates. A more recent method involving 
a logistic transformation ig,also used. The powers of several related x* tests are 
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compared with the result that tests based on sampling by case 2 or 3 above have 
higher asymptotic power than one based on case 1 sampling. Case 4 lies between 
case 2 and case 3 in this regard. 


ROBERT M. THORNER and QUENTIN R. REMEIN (U. S. Public 
715 Health Service). Some Aspects of Screening Tests for the Detection of 
Disease Suspects. 


The screening of large population groups for chronic disease in a growing health 
department activity. The purpose of screening is the selection, for diagnostic 
evaluation, of persons in the population with a high probability of being diseased. 
The application of screening tests requires a basic knowledge of the attributes of 
these tests and an understanding of the results that may be anticipated when a 
particular test is applied to a population group at a selected screening level. 

Tests used in screening must be simple and inexpensive; should be sensitive, 
specific, accurate, and precise. The theory of overlapping distributions is used to 
explain sensitivity and specificity. The stability of these measures and their relation- 
ship to prevalence is discyssed. The effect of altered screening levels on sensitivity 
and specificity is illustrated. 

Problems of selecting screening levels are discussed, together with the 
relationship of sensitivity, specificity and prevalence to the frequency of false nega- 
tive and positive test results. Youden’s test evaluation index is discussed in relation 
to comparing test results with chance. 

The static and stochastic models of screening tests are considered, and “‘relative”’ 
sensitivity and specificity are reviewed. The calculation of confidence limits for 
sensitivity and specificity percentages is illustrated, and statistical techniques used 
for comparing results of different screening techniques, for correlated and uncorre- 
lated models, are reviewed and illustrated. ‘Precision’ and ‘‘accuracy”’ are defined, 
and the measurement of precision and comparison of precision measurements is 
illustrated. 


716 R. M. THRALL (University of Michigan, Willow Run, Mich.). A Review 
of Mathematical Aspects of the Theory of Measurement. 


The past fifteen years have witnessed a vast advance in the techniques and 
theories of measurement. Methods and theories appropriate to the physical sciences 
have been extensively generalized so as to become more applicable to the behavioral 
and life sciences. The present paper is limited to a review of certain mathematical 
systems which either have been used in measurement or show promise of such use. 

Measurement in the physical sciences centered on description of some attribute 
of a physical object by a real number. S. S. Stevens pioneered in generalizing this 
concept of measurement, and his work stimulated still further generalization and 
abstraction. 

A fundamental point of view in modern measurement theory is that the mathe- 
matical system used for measurement should be no stronger than is justified by 
the properties of the-objects being measured. For example, use of the real numbers 
to measure hardness is not justified since neither addition nor multiplication has 
meaning relative to hardness. All that is physically meaningful is that relative to 
hardness objects can be placed in order with ties permitted. Thus for the ‘‘degrees 
of hardness” only a chain order is mathematically justified. The presence of ties 
in hardness indicates that a weak order is suitable for the objects themselves. 
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The present paper is devoted to a review of the development during the past 
fifteen years of mathematical models appropriate for measurement. These models 
vary in the strengths of the axiom systems used and in the extent to which stochastic 
theories are employed. Most of the early systems are entirely deterministic; later 
systems are increasingly stochastic in nature. 


M. B. WILK and R. GNANADESIKAN (Bell Telephone Laboratories, 
717 Murray Hill, N. J.). Some Remarks on Plotting Procedures in the Analysis 
of Experiments. 


The paper deals with certain problems of estimation, associated with plotting 
procedures in analyzing experiments, using rank order statistics. A detailed discus- 
sion of the half-normal plotting procedure in this connection is included. 


The following are abstracts of papers presented at The Symposium on Quantitative 
Methods in Pharmacology held in Leyden, The Netherlands, May 10-18, 1960. 


71g J- HAJNAL (London School of Economics, London, England). Sequential 
Tests on Analgesics in Rheumatoid Arthritis. 


The paper is concerned with a series of clinical trials carried out in the 
Rheumatism Research Clinic of the Royal Infirmary, Manchester. The background 
and trial procedure are briefly described. The statistical analysis appropriate to 
this procedure, if sequential methods are not used, is compared with the sequential 
method in fact employed. The advantages and disadvantages of incorporating 
this sequential method in the design are discussed and some possible improvements 
in the sequential analysis are indicated. 


719 N. L. JOHNSON (University College, London, England). On the Choice 
of a Sequential Test Procedure. 


1. In the construction of a standard sequential probability ratio test (s.p.r.t.) 
procedure we need to define (i) two pivotal hypotheses Ho , H: , (ii) two (approxi- 
mate) probabilities of error ao , a: , such that 


Pr {reject H; | H;} = a; G = 1,2). 


It is known that the s.p.r.t. is then optimal in the sense that among all procedures 
satisfying (ii) the s.p.r.t. has (nearly) the smallest expected size of sample whether 
either or H is true. 

2. Conditions (i) and (ii) are the same as those often used in determining the 
size of sample to be taken in a classical fixed sample procedure. In such cases the — 
hypotheses H» and H;, (and the probabilities a» and a:) are chosen in the light of 
the required sensitivity of the procedure. For example H» may be a ‘null hypothesis’ 
(and ao a ‘level of significance’) corresponding to a desirable level of quality, with 
H, representing a change in quality of such an amount that a high probability 
(1—a:;) of detecting it is needed. 

If Hy and H, , determined in this way, are used to define a s.p.r.t., the full 
advantage of the sequential method may be lost unless either Ho or H; is true 
(or nearly so) in a large proportion of cases. When a hypothesis H, different from 
both Ho and H, , is true there is no guarantee that the s.p.r.t. procedure defined 
by He, Hi, ao, a: will even have a lower average sample size than a fixed sample 
procedure defined in the same way. 
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3. As.p.r.t. satisfying (ii) need not necessarily be based on Ho and H, as pivotal 
hypothesis. It is suggested that consideration should be given to the introduction 
of additional hypotheses Hj , H{ , to be used as pivotal hypotheses. Hj and H{ 
would be chosen to represent the sort of conditions expected to be encountered 
rather frequently—so that the saving in average sample size should be as effective 
as possible. The associated (approximate) probabilities of error af , a{ (where 
Pr {reject H}/H;} = af ;j = 1, 2) are then determined by conditions (ii). 

4. In this paper detailed consideration is given to the case when the s.p.r.t. is 
regarded as a test of the ‘null hypothesis’ Hy , and where we take H} = Hy. Some 
tables are given to aid in the construction of the test procedures, and also to describe 
the properties of such tests. 

5. Incidentally to the main topics, some results are obtained on the conparison 
of operating characteristics (or power functions) of comparable s.p.r.t.’s. 


729 CHR. L. RUMEE (Free University of Amsterdam, Neth.). An Efficient 
Design for Comparing the Effects of Two Treatments. 


The variability of the reactions of groups of animals to drugs often interferes 
with an efficient comparison of the effects of two treatments with all or none re- 
sponses, In classical EDse-determinations it is difficult to choose such dose-levels 
that the inefficient 0- or 100% effects are avoided. A design is to be described which 
leads to an efficient test for the difference between two treatments at an optimal 
dose level. A rough estimate of the ED,o’s can be made by slightly modifying this 
design—though with some loss of efficiency. 


721 K. L. SMITH (Boots Pure Drugs, Nottingham, England). Sequential 
Analysis Applied to Biological Control Tests for Pharmacopoeial Substances. 


During some biological examinations to control the quality of pharmacopoeial 
‘ysubstances, sufficient evidence may be obtained at one time to satisfy the official 
requirements and the results from such examinations may be analysed by the tradi- 
tional methods. 

In some examinations, such as those to exclude undue amounts of pyrogens, 
the evidence collected at one time may be sufficient to accept or reject the material 
in the extreme cases. For intermediate cases repeat tests are permitted and the 
criteria for them may be conveniently derived from a sequential sampling plan. 

In a large number of tests the evidence must be collected sequentially either 
as single responses as in the assay of Digitalis using guinea pigs or pigeons, or as 
the results from repeated tests which individually are too imprecise as in the assay 
of Insulin. Both circumstances lend themselves to the application of sequential 
analysis. In both cases where one has set out to make a certain number of observa- 
tions before applying traditional methods of analysis, the value of using a sequential 
plan may be one of economy only. In those cases where testing is continued until 
the sample may be considered satisfactory, it is essential that the amount of repeat 
testing should be sufficient to satisfy a sequential sampling plan. 


722 D-4J. MEWISSEN and E. H. BETZ (Universit. de Liége, Belgium). Se- 
quential Tests in Protective Effects of Cystamine and Chlorpromazine. 


A simple method based on sequential test has been used by Kimpa.t and al. 
(Radiation Research, 7, 1, 1957) for screening compounds in radiation protection. 
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Under specific conditions, the method is easy to apply and the decision of accepting 
or rejecting a prospective action of protection is based on the difference in animals 
surviving in the treated and control groups. 

The effectiveness of protective agents widely varies with the delay between 
administration to animals and exposure. The responsible factors and the under- 
lying mechanisms are poorly understood. Therefore, we investigated variations in 
the protective action of cystamine and chlorpromazine versus delay in X-irra- 
diation of treated animals. Usual sampling methods would have been unpractical 
and expensive as existence of a protective effect was questionned under various 
time specifications. Sequential tests were used. A 60% survival in the treated 
group was assumed a meaningful increase over 50% survival in the control group. 
Maximum risks for both kinds of errors were set at 10%. Tests were arbitrarily 
stopped after five steps. Decision was usually reached after three steps. 

In a first series of experiments, groups of ten animals were used. In treatment 
groups, chlorpromazine (5 mg) was injected intraperitoneally to adult pure inbred 
rats (average body weight: 180 g), respectively 2, 3 and 4 hours prior to total body 
irradiation (750 r). In a second series of experiments, groups of 5 animals were 
used. Cystamine (100 mg/100 g of body weight) was injected intraperitoneally 
to rats, respectively 2, 10, 20, 30 and 45 minutes prior to irradiation (900 r). In 
each instance, a sequential test was performed on successive groups. In addition. 
mortality in all groups was recorded for 30 days and mortality curves were drawn. 
In all cases, decisions reached at an earlier stage with sequential tests were confirmed 
by further examination of the completed mortality curves. 


E. W. PELIKAN (Boston University, Boston, Mass., U.S. A.). Dose Meta- 
meters in Comparative Pharmacology. 


Both inter-species and intra-species comparisons of sensitivity to drugs of 
common mammals, including man, may be facilitated by expressing dose as a 
power function of body weight. Determination, from published data, of the con- 
stants in the equation [log mean total dose (mg) = 6 log mean body weight 
(gm) + a] yielded the power function best suited to inter-specific comparison of 
potency of drugs of three classes. When paralysis was the criterion of effect, b for 
tubocurarine, its dimethylether, benzoquinonium, gallamine, and succinylcholine 
did not differ significantly from 1.0; for decamethonium, b was 0.60, significantly 
less than 1.0. When anesthesia was the effect, b for amobarbital, pentobarbital and 
secobarbital was about 0.80 and b was independent of route of parenteral drug 
administration. When acute death was the effect, b was 0.83 for ouabain, 1.01 for 
digitalis tincture. In contrast, analogous values of b for digitoxin, digitalis, ouabain 
or Lanatoside C (computed using data for total lethal doses and body weights of 
individual members of a given species) varied from 0.52 to 0.89 (median: 0.70) for 
these drugs in each of several species. Minimizing inter-species variance in potencies 
of drugs, precision in estimating and predicting drug potencies among species or 
individuals, and detection of species differences in sensitivity to drugs may be 
furthered by use of an appropriate dose metameter. 


724 P- J. CLARINGBOLD and C. W. EMMENS (University of Sydney, 
Australia). Biological Assays Involving Quantal and Semi-quantal Responses. 


Various transforms have been used for interpreting the results of biological 
assays and other experiments when the response is quantal. Usually, intervals 
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are studied in which transforms such as the probit, logit or angular show no per- 
ceptible differences, and it ie a matter of practical convenience which is used. The 
angular transform is often the most useful, as it :as equal weights at all response 
levels, 

In much early work, few dose-response liaes had to be calculated, and the 
design of tests was simple. More and more frequertly however, screening trials or 
complex designs require methods which offer -apid analysis, even if approximate. 
Also, within-animal quantal assay will usually yield few responses from each animal 
and the application of even the angular transform becomes extremely complex. 
In such circumstances, analyses of variance of the untransformed data (scored 
0, 1 or 0, 1, --+ =) is defensible and gives rapidly computed estimates which show 
little loss of information and allowed of up to a 4-fold increase in precision in cross- 
over assays with mice. In factorial tests, probit or even angular transformation 
leads to formidable computations if many factors are involved, whereas analysis 
of variance of the untransformed data is simple and remarkably accurate in practice. 


725 R. VAN STRIK (Philips-Duphar, Weesp, Neth.). A Simpie Method of Esti- 
mating Relative Potency and its Precision from Semi-quantitative Responses. 


A simple procedure is descmbed for evaluating those bio-assay experiments 
where the observed responses are scored in a semi-quantitative scale, such as =, 
+, ++, +++. The initial responses are ranked together in order of magnitude 
and a potency ratio with its confidence limits are calculated from the ranks using 
customary bio-assay computational methods with just one minor alteration. The 
method is outlined in an example and its reliability is investigated. 


7126 CLAUS RERUP (The Royal University, Lund, Sweden). On the Validity 
of a Cross-over Assay. 


Cross-over assays are valid only, when the following criteria are fulfilled: 

1) The assay design has to be balanced, which means that a set of observations 
always has to be obtained not only in a certain order, but simultaneously in the 
reverse order because of possible changes in the magnitude of reactions between 
treatments within subjects. 

2) No significantly different changes in reaction between groups of subjects 
are allowed to occur from treatment to treatment. 

For the twin and triplet-croas-over assay (insulin, corticotrophin) it is shown 
that the hitherto applied validity tests (in the twin cross-over assay the test for 
parallelism of the dosage response lines for standard and test, in the triplet cross- 
over assay extended by a test for curvature of the mentioned lines) are not appropri- 
ate, because group differences may cause apparent significant differences in slope. 
Group differences are, however, eliminated in the cross-over analysis. An example 
of a twin cross-over assay on insulin is given, woich must be regarded as invalid 
according to hitherto published validity criteria, which, however, is a good and 
valid assay according to the essential validity criteria to be presented in the paper. 
727 J. HEMELRIJK (Technological Institute, Delft, Neth.). Experimental 

Comparison of Student’s and Wilcoxon’s Two Sample Tests. 


The choice between the many available statistical methods is often difficult to 
make in practical situations. It is often—too often—based on personal preference 
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without sufficient regard for the merits of the different methods available. A lack 
of knowledge about the small sample properties of many distribution-free methods 
tends to make their position uncertain in comparison to classical methods, based 
on specified suppositions about the sampled populations; the properties of the latter 
are known, if these suppositions are true. 

In a small experiment, using QUENOUILLE’s (Biometrika, 46 (1959), 178—204) 
50 independent pairs of samples of size 10 were used to compare STUDENT’s and 
Wicoxon’s two sample tests. 

The samples were first taken from a standard normal distribution (QuE- 
NOUILLE’s ;), then one of them was shifted over distances of 0.81; 1.05 ard 1.53 
to the right. These shifts correspond to a power for SrupENT’s test of 0.40; 0.60 
and 0.90 respectively. In all these positions both tests were applied onesidedly 
with level of significance 0.025. As was to be expected—SrupEnt’s test being 
uniformly most powerful in this situation—the power of WiLcoxon’s test was the 
smaller one. The results indicated that in this case one would not be far off the 
mark in stating that, roughly, the loss of power was about 10% of the power of 
StupEnt’s test as long as the latter is not too close to 1. (This also holds under the 
hypothesis tested: the true level of significance is for SrupENT’s test 0.025 and for 
WILcoxon’s test, owing to its discrete character, 0.022 in this case.) 

A second point of investigation was the reaction of the tests to an observational 
error in one of the original observations. For each pair of samples an observation 
was chosen at random from the sample with the larger median and this observation 
was shifted over a distance 2 to the right. In accordance with expectation StTUDENT’S 
test proved to be far more sensitive to this mild kind of slippage than W1icoxon’s 
test. 

Finally all observations, transformed to observations from a rather skew ex- 
ponential distribution (QUENOUILLY’s x«), were again ‘sted in the same way. The 
shifts were in this case replaced by multiplication of the observations of one of the 
samples from the starting point of the distribution, in order to keep this point 
invariant. The factors were: 1 (identical distributions); 2; 3 and 4.5, resulting 
in power function values of roughly 0.025 (the level of significance); 0.25; 0.45 and 
0.70 respectively. The two tests were now approximately equally powerful. 

All these results are dependent in a statistical sense, being based on the same 
1000 observations. Also the investigation was too small (although laborious) to 
warrant strong conclusions. It is only the beginning of more extensive research 
with more powerful means. 


H. DE JONGE (Netherlands Institute of Preventive Medicine, Leyden, Neth.). 
728 The Influence of Non-normality on the Significance Levels of Student’s Two 
Sample Test. 


The paper describes an empirical sampling study of the distribution of Student’s 
i for small samples taken from four populations with non-normal distributions, 
respectively showing: A. Marked skewness, B. Skewness with a bunch of outliers, 
C. Leptokurtosis with skewness, D. Platykurtosis with skewness. These types 
of distributions were chosen, because they were frequently seen in practice. 

From each population pairs of random samples of 5, 10 and 20 elements were 
drawn and the values of t were computed. For each of the populations and for 
each sample size the empirical sampling distribution of ¢ has been compared with 
the corresponding Student-distribution. 
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729 H. R. VAN DER VAART (University of J.eyden, Neth.). On the Robustness 
of Wilcoxon’s Two Sample Test. 


It has long been known that the probability of Type I erior of Student’s two 
sample test is subject to appreciable variation in the case of unequa] population 
variances (Hsu, 1938, Stat. Res. Memcirs; Chand, 1950, Ann. Math. Stat.). There- 
fore it is interesting to know the behaviour of Wilcoxon’s two sample test in this 
respect. Van der Vaart (1950), {ndag. Math.) found in a special case for very small 
samples that Wilcoxon’s test is iess sensitive to inequality of population variances 
than is Student’s tests, Kruskal and Waliis (1952, Journ. Amer. Sta.. Assoc., put 
forward an argument. which lends plausibility to the conjecture that Wilcoxon’s 
test may he fairly insensitive to differences ‘n variability, and Hodges and Lehmann 
(1956, Ann. Math. Stat. p. 327-329) show that Wil-oxon’s test is often less powerfu! 
against contamination with a shift chaa is Student’s test. All this might ead one 
to expect that Wilcoxon’s test would 1n geu2ral be less sensitive to the above-~™en- 
tioned inequality of variances than is Sivdeut’s test. Now this turns out not .- 52 
uniformly true. For instance, in the extreine case where one of the population 
variances is zero. the relation betweeu the samyie sizes determines which test wil! 
behave worst in thie respect. In the gove:a -s the situation is rather complicated. 


I. JUVANCZ (Math. Inst.tute, Hmgaran Acactemy of Science, Budapes}. 
730 Hungary). Contra Indication Ne~-psrametric ests in Medical Experi. 
mentation. 


According to the anthor’s personal experieuce ‘rom =e des. ¢ 4/or analysis 
of over 700 experiments (most of them medice!), the statisti-ians gaged in 
experiments show 4 reluctance to use non-parametric ~eti. *s. thoug.. numei. 
theorists enthusiastically advocate them. A  tatistical 1 imad. ~f papers 
from some medical, biometrical and statist'cal periodica’s, publishea in various 
countries. The results show the same tenae.icy as the author has noted av home 

The reluctance of those “qualified :n medicine or, if not so qualified, at least 
well soaked in it’’ is caused by zome characteristic features of medi 1 experimer- 
tation, well known to them. So eg. in medical experiments wu terest is mainly 
centered on the estimation of mear values and their fiducial |. .tc -nd on materiai 
significance. There is great varmation between and within individuais; the measure- 
ments are poor; the number of observations is small. Experin.ents 2re sometimes 
dangerous. Numerous factors disregarded in the statistical analysis influence the 
final decision. Experimental results are retested several times b; others before 
they are accepted. The results are ofter intended ‘or individual use (e.g. diagnostic 
test). Etc. etc. 

These characteristics compel the researchers to use statistical methods which 
yield results convertible to the original measure (e.g. kg, cm). Order statistics 
and the x?-tests are not of this kind. Further, to minimize the loss of information 
non-parametric methods are too wasteful. Consequently non-parametric tests are 
economically inefficient both qualitatively and quantitatively. 

Not rarely are non-parametric tests insensitive to changes regarded medically 
substantial and too sensitive to insubstantial ones. (E.g. x*-tests are insensitive to 
consistent though little increasements and sensitive to great though inconsistent 
ones; this is not the case with the ¢test.) If is often impossible to decide what 
caused the statistically significant change indicated by a non-parametric test (mean, 
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type of distribution etc.). On the other hand, tests of the mean are very “robust” 
against non-normality. 

Only s small field of application is left for non-parametric methods: the cases 
where the experimental techniques are too weak; e.g. when the results are only 
given in rank-numbers as sometimes in psychometrics. 

Consequently, the use of non-parametric methods should be avoided, if necessary 
by the invention of better experimental (e.g. chemical) methods. 


731 L- J. GOLDBERG (University of California, Berkeley, Cal., U.S. A.). A 
Visual Statistical Approach to Bio-assay. 


In quantal bio-assay one usually starts with the following information: 


‘A series of doses , d:, , d;) has been administered , Nx) 
test hosts respectively, and the corresponding reactors (ri, 72, °** , 7s) are observed 
at each test dose level’. 


Based on a set of such experimental observations, a visual procedure will be 
presented for estimating the true dose response. The proposed procedure is based 
on the concept that a single experimental observation should be considered as a 
triplet (r, N, d). The entire set of experimental observations may be considered 
as follows, in triplet notation. 


(ri, Ni, 1), (r2, de), , (rs, Ne, ds). 


The problem of estimation is reduced to the operation of ‘triplet addition’ 
conditional to an assumed dose response model. The rules for triplet addition 
will be illustrated in detail for the single hit dose response model. The extension 
to other dose response models will be considered, including the logit and probit 
as specific examples. 


732 CONSTANCE VAN EEDEN (Mathematical Cenire, Amsterdam, Neth.). 
On Distributionfree Bio-assay. 


In bio-assay the following situation is considered: a stimulus (e.g. a drug or a 
vitamin) is applied to a subject, resulting in a response produced by the subject. 
In this paper we only consider the quantal (all-or-nothing) response. Then for 
each subject there will be a level of intensity of the stimulus above which the re- 
sponse occurs and below which it does not occur; this level is cailed the tolerance 
of the subject. On a population of subjects this tolerance will be a random variable 
A, with distribution function F(A), say. The problem in bio-assay is to give an 
estimate of this distribution function F(A). A second problem is to compare the 
distribution functions F(A) and F(A) for two different stimuli. The observations 
then consist of the results of applying the stimulus once to each of a number of 
subjects, at several doses. 

The abovementioned problems are solved for the parametric case, e.g. when 
d has a normal distribution; the methods given for this situation are called “probit 
analysis” and are described e.g. by D. J. Finney (1947). 

The application of this method, however, requires in some cases laborious 
computational work. Moreover difficulties arise if, for one or more doses the number 
of observations is very small or if none of the subjects or all subjects give a response. 
Finally the assumption of normality may not be fullfilled. 

This paper contains a description of what may be obtained if no assumptions 
are made on the form of the distribution of 4. Let observations be available at 
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k different doses d; , , dy with d; < <d,. The number observations at 
dose d; is denoted by n; and the number of subjects giving a response at dose d; 
is denoted by a; . We suppose all observations to be independent. Let further p; 
denote the probability that a subject gives a response at dose d; then pi; = F(di); 
consequently, F(X) being a monotone non-decreasing function of A, we have 
Spe. 

For this situation maximum likelihood estimates of p; = F(d;) may be obtained 
by means of a method described in the thesis of C. van Expmn (1958). In this 
thesis a more general problem is considered, where k parameters of k distribution- 
functions are to be estimated, if it is known that these parameters are partially or 
completely ordered and moreover confined to given intervals J;. If the distributions 
are binomial, the ordering is complete and J; is the interval (0, 1], we obtain the 
above sumiened problem of estimating probabilities p. , --- , ps satisfying 
PiS-*> SDs. 

not necessary to have a large number of observations at each dose. Thus a maxi- 
mum likelihood estimate of F(A), at the doses used in the experiment, is obtained 
without any assumptions of a parametric nature about the distribution function 
F(A). 
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733 C. W. DUNNETT (University of Aberdeen, Scotland). Statistical Theory 
of Drug Screening. 


Screening a chemical compound for some specific form of chemotherapeutic 
activity is considered as a decision problem in which the terminal decisions open 
to the investigator are either to accept the compound as being worthy of further 
investigation or to reject it as being of no interest and the unknown “‘state of nature” 
is the degree of activity of the compound. A truncated sequential procedure for 
screening is proposed, in which rejection can occur at any stage but acceptance is 
allowed only at the final stage. The particular case in which only two levels of 
activity exist (called “active” and ‘‘inactive”) and all compounds have the same 
@ priori probability of being active is considered in detail. A criterion of optimality 
is introduced, which involves the cost of testing and the costs associated with wrong 
decisions. A method for computing the critical rejection levels when the testing 
errors are normally distributed is given, and illustrated for one, two and three- 
stage procedures. Extensions to cases in which some compounds are more likely 
to be active than others, and in which more than two levels of activity may occur, 
are discussed. 


‘M. A. SCHNEIDERMAN (Cancer Research Institute, Bethesda, Md., 
734 U.S.A.). Statistical Problems in the Search for Anti-cancer Drugs by the 
National Cancer Institute of the United States. 


The development of the external (contract) screening program of the Cancer 
Chemotherapy National Service Center is described. The basic experiment is a 
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joint toxicity-therapeutic trial. This permits the administration of maximum 
tolerated doses without the need for extensive prior toxicity testing. The basic 
screen uses three different mouse tumors with each tumor system examined in- 
dependently. A multi-stage sequential scheme is exployed; a two-stage scheme for 
“natural” products and a three stage scheme for synthetic chemicals. The operating 
characteristic curves for the two systems are compared. The acceptance and re- 
jection levels have been so adjusted that the two systems yield the same number 
of false positives, while the two-stage scheme yields more false negatives. This 
can be tolerated because of the nature of the natural products coming to the screen. 
The actual behavior of one of the screens is appraised by examining the responses 
to a “known positive’ compound, tested by different screeners in different places 
at different times. 

Problems of mechanical data processing, the use of additional screens beyond 
the basic three, secondary screening, optimal dose-finding, “positive” and ‘‘nega- 
tive” controls, possible deterioration of natural products, and some possible non- 
parametric approaches are considered, as examples of unsolved problems. 


J.J. GRIMSHAW and P. F. D’ARCY (Allen Hanburys, Ltd., Ware, England). 
735 Some Problems Arising in Drug Standardization and the Selective Screening 
of Compounds of Potential Pharmacological Interest. 


During the course of routine standardization and screening procedures a variety 
of problems have been encountered. Examples of in vivo and in vitro techniques 
are given, in which the assays of purgatives and succinylcholine together with the 
screening of potential analeptics and barbiturate potentiating drugs, antihistamines 
and neuromuscular blocking agents are considered. Solutions to some of the prob- 
lems are proposed and the need for careful statistical control throughout is em- 
phasised. It is suggested that, in some cases, insistence on statistical rigour may 
lead to results of diminishing practical importance, and the implications of this 
are discussed. 


NORBERT BROCK (Asta Werke, Brackwede, Germany) and BERTHOLD 
736 SCHNEIDER (Karlsruche, Germany). Pharmacological Characterization 
of Drugs by Means of the Therapeutic Index. 


For the pharmacological characterization of drugs it is not sufficient to describe 
their different qualities of action as such; in order to obtain a clear picture of the 
value and dangers of any substance, i.e. of its margin of safety, it is necessary to 
plot one of its actions against the other. As a measure of the margin of safety we 
introduced the therapeutic index (DL 5/DC 95 or DL 5/DE 95) which expresses 
the relation between the curative and toxic doses. Based on more than 10 years’ 
experience, the author discusses some practical examples in various fields of pharma- 
cotherapy. The trouble in applying the therapeutic index is the difficulty in ob- 
taining approximate values for the deviations of the index and in having suitable 
testing methods available. Both problems are discussed and sugggstions made 
for practical use. 


737 E. J. ARLENS (University of Nijmegen, Neth.). Analysis of the Action of 
Drugs and Drug Combinations. 


As a rule the determination of biological activities of drugs aims at the com- 
parison of the results obtained with various drugs or with one drug under different 
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circumstances, as for instance different doses. The experiments must therefore 
be based on a sound design and the experimental results evaluated with the aid 
of statistical methods. A primary question remains, however, whether the effects 
or magnitudes that are compared are really comparable in the sense meant by the 
investigator. This is especially of importance if the experimental results serve 
the study of relations between the chemical structure of drugs and their biological 
activity. Based on the result of such studies a more d‘rect design of new drugs, 
with a minimum of trial and error in the research procedure, might be possible. If 
experimental data are compared, au analysis of the biological effects studied is 
necessary in order to find out if, to what degree and in wnat sense these data are 
comparable. This will be demonstrated hy seme examples. . 


738 S. DIKSTEIN (Hadassah Medicai Schou!, Jerusalem, Israel). Physical 
Chemistry of Drug-receptor Interaction. i 


1) The effectivity in vitro of homologous series of aliphatic choline esters, 
straight chain aliphatic primary amines and straight chain aliphatic monoquater- 
ary ammonium compounds was determinea on the smal! intectine oi ,uinea pig. 

2) Morphine, an inhibitor of the acetylchoiine release, inhibits the contractions 
caused by small doses of nicotine, but nas no effect on the contraction caused by 
acetylcholine. This phenomena is saimiler to the mhibition caused by botulinum 
toxin, which is known to act at the nerve ending. Since morphine has r_ effect 


on the contractions caused by any of the homologous amines tested by us, it was . 


concluded that the primary amines and monoquaternary ammonium ions act 
directly on the smooth muscle. 

3) In the homologous series, the activity increases with increasing number 7f 
methylene groups up to a certain limit, the further addition of a single methyleue 
group then abclishes the contractive power or makes the compound inhibitory. 
Addition of more methylene groups first increases the inhibitory power and then 
decreases it. 

4) A qualitative explanation is offered to oxplain the cbserved facts. According 
to this hypothesis in the presence of a positive charge, if the solubility parameter 
of the compound is higher than that of the biophase we get excitation, i: the solubility 
parameter is lower we obtain inhibition. The smaller the difie~ence the greater the 
effectivity. On the molecular levei the excitation is oxplaineu ' y other workers 
by the interaction of materials with monolayers. According to this work it is 
postulated that if the material has a lower soiub‘lity parameter tha. that of the 
biophase it will dissolve in it increasing the spreading force. A material with higher 
solubility parameter will decrease the spreading force. 

5) Since the excitation is a kinetic phenomenon (the velocity of addition of 
the excitant determines the activity) whereas the solubility parameter represent 
in fact a term of energy, it is not difficult to explain the absence of quantitative 
predictions. 

6) The importance of the solubility parameter in other drugs is discussed. 


739 8S. E. DE JONG (University of Leyden, Neth.). IsuLoles. 
In this lecture reference is made to Loewe’s method, by which the paz.-a- 


cological effect of two drugs in combination is diagrammaticaily representet by » 


means of isoboles. 7 
A definition with a short description is given, followed by a survey of the most 
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important types. Some special cases are discussed more fully. It is explained that 
some forms of synergism do not lend themselves to being represented by means 
of isoboles (allobiotic synergism; inversion). Not only because of this do the isoboles 
not yield an optimal solution to the pertaining problem. It may be considered 
objectionable that it is necessary to estimate the exact amounts of a series of mixtures 
of two drugs needed for a quantitatively specified effect. Besides, the isobole thus 
obtained even then gives only incomplete information; for stronger or weaker effects 
an entirely different type of isobole can be found. 
In conclusion Loewe’s rather complicated nomenclature is enumerated. 


749 ©: 8S. HEWLETT and R. L. PLACKETT (Pest Infestation Laboratory, 
_ Slough, England). Models for Quantal Responses to Mixtures of Two Drugs. 


The various previous approaches to the construction of mathematical models 
for quantal responses to mixtures of drugs are briefly referred to, and it is pointed 
out that models for graded responses to mixtures cannot in general be transferred 
direct to the interpretation of quantal responses. A general method for developing 
models for quantal responses to mixtures is put forward. This is exemplified by 
development of a model for simple similar joint action, i.e. a situation in which 
two drugs have a common site of action, but in which neither modifies the behaviour 
of the other at their site of action or elsewhere. 


741 J. GURLAND (Iowa State University, Ames, Iowa, U.S. A.). Determination 
of Minute Insecticidal Residues through Biological Assay. 


Samples of alfalfa from a field sprayed uniformly with Guthion, were collected 
1 day, 3 days, 7 days, 2 weeks, 4 weeks after spraying. The residue in these samples 
was determined by means of a biological assay in which houseflies were exposed to 
increasing doses of the insecticide. Since sample extracts contained lipids and other 
materials which caused a masking effect, the standard preparations used for com- 
parison were made to contain comparable amounts of control extract. The usual 
type of parallel line assays employing mortality of houseflies were employed for 
determining the residues in the 1-day, 3-day, and 7-day samples, but the extracts 
for the later samples had to be fortified with known amounts of the Standard in 
order to raise the response from a very low level to a level at which a parallel line 
assay could again be employed. 

To increase the sensitivity of the technique it was found that the ratio of the 
volume containing the fortifying Standard to the volume of the Test extract should 
be made as amall as possible. Further, the final volume of Test extract should be 
small relative to the sample mass being extracted in order to increase the sensitivity 
expressed in such units as parts per million of sprayed alfalfa. 


742 A. G. MATHEWS (Dept. of Health, Victoria, Australia). The Interpretation 
of Antibiotic Blood Level Curves. 


Although hundreds of investigations of antibiotic blood levels have been reported, 
in comparatively few has any examination of the statistical significance of observed 
differences been performed, and in even fewer has any attempt been made to assess 
the pharmacological significance of the results. Consequently, the literature is 
filled with conflicting reports as to the relative efficacy of various dosage forms, 
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and the lack of adherence to any common experimental design makes the resolution 
of such conflicts very difficult. 

The problem of deriving a single statistic, with which to summarize the infor- 
mation contained in a set of blood level curves, is rendered difficult through ignor- 
ance of the relative importance of the height of the peak, the time at which the 
peak occurs, and the period during which the curve is above an arbitrarily chosen 
level. Attempts which have been made to derive such statistics will be reviewed, 
and their limitations discussed. 

The shape of the blood level curve in the region of the peak is usually difficult 
to determine, because of practical objections to the frequent collection of samples 
of blood. Advantages of experimental designs, in which only a few samples are 
required from each of many subjects, will be discussed. 

No matter how carefully sets of blood level curves may be determined, nor 
how ingeniously their properties may be summarized statistically, the problem 
of pharmacological interpretation remains. Stress will be laid on the importance 
of ancillary studies: for example, of the rate of excretion of the antibiotic, and 
of its distributiou in the tissues of the body. 


W. Z. BILLEWICZ (University of Aberdeen, Scotland). A Statistical In- 
743 vestigation of Factors Affecting the Accuracy of D,O Determinations by the 
Falling Drop Method. 


After a short description of the methcd the results of the analysis of the trial 
series are briefly discussed and taken as the basis of subsequent experiments. The 
experimental results cover various aspects of the method. The effects of changes 
in room temperature, position of the dropping tube and dropping technique are 
considered. Errors of the distillation stage are considered and discounted. The 
problem of the optimal selection of tube and drop size is considered in some detail 
in terms of an index of sensitivity of the method. Errors resulting from observer 
timing as opposed to photo-electric timing are discussed. A comparison of two 
main estimation metheds is made and finally the problem of serial estimates on tie 
same subject is considered. 


744 J. DUFRENOY (Conservatoire National des Arts et Métiers, Paris, France). 
Using the Arc Tangent Scale for Quantitative Methods. 


The arc tangent scale, used either for the abscissa or the ordinate scale, or for 
both, is most convenient for plottimg ‘‘responses’ as the dependant variable, . 
the range 0 to 100, against the independant variable, in the range 0 to infin: 

The arc tangent is most eminently suited for transfcrmation of the time ‘ 
into a metameter (7). 
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british Region 
The following Officers have been elected for 1961: 


President: J. A. Fraser Roberts 
Secretary: C. D. Kemp 
Treasurer: P. A. Young. 


The Summer Meeting of the Region was held on July 6, 1960, at the National 
Vegetable Research Station, Wellesbourne, Warwick. 


A meeting on October 27, 1960, was concerned with “Applications of Electronic 
Computers to Biological Problems.’’ The following papers were read and discussed: 


W. T. Williams—Some Applications of Electronic Computers in Plant Ecology 
and Taxonomy. 


J. N. R. Jeffers—Data Processing in Biological Research. 


F. Yates—Problems Arising in the Use of Electronic Computers in Statistical 
Analysis. 


Following the Annual General Meeting on December 13, 1960, the following 
papers were read and discussed: 


C. W. Emmens—The Planning and Analysis of Some Field Trials with Cattle. 
C. C. Spicer—Problems in the Analysis of a Large Scale Clinicial Trial. 


On January 2, 1961, a special meeting was held on “Biometrical Aspects of 
Plant Growth” in collaboration with the Society for Experimental Biology. The 
programme included: 


J. A. Nelder—Models and Experiments for Growth Analysis 


S.C. Pearce and C. S. Moore—A Study of the Sources of Variation in Growth 
of Fruit Trees. 


M. J. R. Healy—Experiments for Comparing Growth Curves. 
A general discussion was led by F. L. Milthorpe. 


E.N.A.R, 


The following officers have been elected for 1961 by the Eastern North American 
negion: 


President, 1961: Oscar Kempthorne 

Fresident-elect, 1961: Henry L. Lucas 

decretary, 1961: Erwin L. LeClerg 

‘breesurer, 1961: Donald A. Gardiner 

Kezioual Committee, 1961-63: Virgil L. Anderson and Roberi J. Monroe. 


190 


| 
i 
2 
4 
| 
4 
q 
= 


THE BIOMETRIC SOCIETY 


ENAR TREASURER’S REPORT FOR 1960 
Marvin A. Kastenbaum, Secretary-Treasurer 


INCOME 


Balance Forward 
Checkbook balance Dec. 31, 1959 
Dues payments 


Share of Proceeds from Washington meeting (1959) 


Credit on Canadian checks 
Payment from WNAR for Programs 
Sustaining member credit 

Payment from AIBS for printing 


EXPENSES 


Checks not honored 
Refund on dues 
Treasurer International 
Transfer to Savings Account 
ENAR expenses 

Postage 

Printing 

Clerical 


Checkbook Balance December 31, 1960 


OPERATING ANALYSIS 


Operating income 
Dues payments 
Other societies 
Bank credit on checks 
Sustaining member credit 


Operating expense 

Surplus for 1960 
CHECKBOOK BALANCE 
Advanced from 1959 
Surplus for 1960 


Checkbook balance November 15, 1960 


SAVINGS ACCOUNT BALANCE 


Advanced from 1959 
Interest 1959 


Proceeds from Washington meeting (1959) 


Total cash in Bank December 31, 1960 


$187.93 
365.78 
30.96 


324.29 
236.73 


16.11 
6.26 
267.88 
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$ 821.40 


584.67 


$ 236.73 


990.25 


$1,551.27 
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561.02 
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W.N.A.R. 
MINUTES OF THE WNAR BUSINESS MEETING 


The annual meeting of the Western North American Region of the Biometric 
Society was held at 1:00 P.M., August 24, 1960, onthe Stanford University Campus. 
William Taylor, President, WNAR, presided. The highlights of the minutes of the 
previous meeting were read by the Secretary, Walter Becker, and were approved. 
The Secretary then gave his report. A method for obtaining News and Announce- 
ment for Biometrics was described. This consisted of sending to everyone who 
changed his address an inquiry as to his position, etc. This system, suggested by 
the ENAR Secretary has produced excellent results. Sometimes ballots are sent to 
student members inadvertently and a reminder was made that they are not entitled 
to vote. Four general mailings to the members of WNAR have been made this year 
The postage cost is about $6 a mailing, but this figure may go up as a result of 
the Secretary’s office being moved to Washington State University, Pullman, 
Washington. 

The Treasurer, Bernice Brown, gave her report. WNAR now has 164 members, 
8 of them students, an overall gain of 14 from last year. As of August 20, 1960, 
there was $445.25 on hand. The operating expenses balanced out the income for 
this year. It was brought out that much of the money in the treasury was accumu- 
lated during past administrations when much of the cost of printing and mailing 
was not born by the Society. WNAR receives $1 for every member, $10 for each 
sustaining member, and nothing from student members. The Treasurer stated 
that perhaps a working capital of $200 would be sufficient in the treasury, declar- 
ing $200 to be surplus. 

The Nominating Committee made its report. Two positions on the Regional 
Committee are now open because of the expiration of the terms of Douglas Chapman 
and Donald Wohlschlag. The names submitted by the Nominating Committee 
were Donald Owen, New Mexico; Lincoln Moses, California; Marion Sandomire, 
California; and Douglas Chapman, Washington. No nominations were made from 
the floor. The vote will be taken by a mail ballot. 

No action was reported by the Relationship of ASA - the Biometric Society 
Committee during the past year by Marion Sandomire. The By-laws of the region 
are being revised by William Taylor and will be submitted soon to the Regional 
Committee. The possibility of having a president-elect was discussed. At present 
the President is elected for two years, in order that he may be more experienced in 
the position. Several alternatives were proposed: 


1. The presidency be made a one-year position. Then we would have a presi- 
dent-elect every year. The president-elect would act as a program chair- 
man and would in addition receive copies of important letters from the 
officers and the Regional Committee. 

2. On alternate years, a president-elect would be elected. 

3. Leave as is. 


It was pointed out that the need for a president-elect might become greater 
as the membership increased. It was suggested that this matter be considered by 
the Regional Committee and be submitted to the membership in a mail ballot. 

Douglas Chapman moved and Calvin Zippin seconded the following motion: 
The By-laws may be amended by a two-thirds vote of members present at any 
annual meeting. Changes in the By-laws may be initiated by the Regional Com- 
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mittee and the By-laws amended by a majority of members voting in a mail ballot. 
This motion was passed by a unanimous vote. 


Suggestions for the use of the surplus fund (about $200) in the treasury were: 


1. Give prizes for the best student paper presented at an annual meeting (it 
was mentioned that this was tried a couple of years ago, but that students 

did not present any papers). 

. Do not use the money. 

. Send bound volumes of Biometrics to foreign universities. 

. Pay students’ dues. 

. Pay the expenses of a speaker at the annual meeting. 

. The funds should be used to encourage interest in the society. 

. They could be used to promote membership (it was pointed out that new 
members’ dues would increase the treasury’s surplus). 

8. Prizes could be given to the member bringing in the most new members. 

9. A year’s subscription could be given to outstanding students. 


No agreement could be reagf#d, so the Regional Committee will debate the 
issue by mail. 

Two possible meeting places for the 1961 Annual Meeting were discussed. 
These were the meeting of the Pacific Division A.A.A.S. at the University of Cali- 
fornia, Davis, June 12-17, 1961 and the IMS Meeting at the University of Washing- 
ton, Seattle, in August (later information indicates that this may be in June). A 
majority voted for the Seattle meeting. 

The purpose of the program at the Annual Meeting was discussed. Should the 
program aim at a specific purpose, or should the papers be an expository review 
aimed at applied statistics? There was very little comment. 

William Taylor announced that he would be attending a meeting in the East 
with representatives of other societies to consider the possibility of a federation of 
statistical societies, besides other purposes. 

The meeting adjourned at 2:10 P.M. 

Those present at the meeting were Abramson, Becker, Bennett, Brown, Chap- 
man, Hopkins, Massey, Mode, Nash, Nicholson, Russell, Sandomire, Taylor, 
Vaughan, and Zippen. 


WH 


NOTE ON ELECTION 


Douglas G. Chapman and Marion M. Sandomire were elected to the Regional 
Committee for 1961-1963. 


CHANGES IN MEMBERSHIP 
(October 1, 1960-January 15, 1961) 
Changes of Address 
Miss Margaret F. Allen, Department of Biometrics, School of Aviation Medicine, 
USAF, Brooks Air Force Base, Texas, U. S. A. 
Dr. David W. Alling, Institute Allergy and Infectious Disease, National Institutes 
of Health, Bethesda, Maryland, U. S. A. 
Professor Maurice S. Bartlett, Department of Statistics, University College, London 
W. C. 1, England. 
Prof. Dr. Eduard Batschelet, 2500 Q Street N.W., Washington, D. C., U.S. A, 
Mr. Rainald K. Bauer, Kolner Str. 228a, Dusseldorf, Germany. 
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Dr. Gilbert W. Beebe, National Academy of Sciences, 2101 Constitution Avenue, 
N.W., Washington 25, D. C., U. S. A. 

Dr. G. E. P. Box, Mathemaics Research Center, U. S. Army, University of Wis- 
consin, Madison 6, Wisconsin, U. S. A. 

Dr. David Bruce, Box 4059, Portland, Oregon, U. S. A. 

Dr. Joseph G. Bryan, 11 Newport Avenue, W. Hartford, Connecticut, U.S. A. 

Mr. Walter E. Cole, Division of Forest Insect Research, U. S. Forest Service, Ogden, 
Utah, U.S. A.’ 

Mr. Jerry Cornfield, National Institutes of Health, Bethesda, Maryland, U. S. A. 

Dr. Paul M. Denson, Deputy Commissioner of Health, 125 Worth Street, New 
York 13, N. Y., U.S. A. 

Prof. Benjamin Epstein, 768 Garland Drive, Palo Alto, California, U. S. A. 

Dr. Robert Flamant, 11, rue Bouille, Fontenay—Aux—Roses (Seine) France. 

Dr. William A. Glenn, 906 King Street, Cary, North Carolina, U. 8S. A. 

Prof. John Gurland, U. S. Army Research Center, University of Wisconsin, Madison, 
Wisconsin, U.S. A. 

Dr. Thomas J. Haley, 3401 Woodcliff Road, Shefthan Oaks, California, U.S. A. 

Dr. Sudako Hayase, 1038 Fifth Street, Santa Monica, California, U.S. A. 

Prof. Dr. Ottokar Heinisch, Grunderstr. 28, Berlin—Grunau, Germany. 

Dr. Henry Hopp, Agricultural Attache—Bogota, c/o U. S. "watt of State, 
Washington 25, D. C., U.S. A. 

Professor Gwilym Jenkinn, Department of Mathematics, Imperial College, London, 
S.W. 7, England. 

Dr. S. Karatas, Department of Animal Husbandry, University of Ataturk, Erzurum, 
Turkey. 

Dr. Eileen B. Karsh, Department of Psychology, University of Pennsylvania, 
Philadelphia, Pennsylvania. 

Mr. Charles Kiss, La Menitre, (Maine-et-Loire) France. 

Mr. Gary F. Krause, 1110 Kentwood Drive, Blacksburg, Virginia, U. 8. A. 

Mr. William T. Lewish, 2109 Wildwood Drive, Woodland Park, Wilmington 5, 
Delaware, U.S. A. 

Mr. Donald G. MacEachern 608 Third Avenue, S. E., Minneapolis 14, Minnesota, 
U.S. A. 

Mr. John W. Mayne, Van Zaeckstraat 49, The Hague, Netherlands. 

Mr. Jean Mothes, 13, boulevard des Invalides, Paris 7e, France. 

Mr. Clifford A. Myers, 2069 N. Navajo, Flagstaff, Arizona, U.S. A. 

Dr. Wesley L. Nicholson, 1215 E. 3rd, Moscow, Idaho, U. 8S. A. 

Dr. Fernando Orozco-Pinan, Explotaction ‘‘El Encin’’ Alcala de Henares (Madrid) 
Spain. 

Dr. Laurence M. Potter, Department of Poultry Husbandry, Virginia Polytechnic 
Institute, Blacksburg, Virginia, U. S. A. 

Dr. D. R. Read, Cadbury Brothers Limited, Bournville, Birmingham, England. 

Mr. A. H. L. Rotti, Av. Emile Verhaeren 70, G-ntbrugge, Belgium. 

Mille. Claude Rouquette, 3, rue Saint-Charles, Paris 15, France. 

M. Jean Soule, 30 avenue Saint-Laurent c. 34 (Orsay) Seine-et-Oise, France. 

Mr. John J. Sowinski, Allstate Insurance Company, 7447 Skokie, Blvd., Skokie, 
Illinois, U. S. A. 

Prof. David F. Votaw, Jr., 6 Fairlane Terrace, Winchester, Massachusetts, U. 8. A. 

Mr. Jerry Warren, Department of Horticulture, University of Nebraska, Lincoln 3, 

Nebraska, U. S. A. 
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Mr. Francis R. Watson, 1415 Winding Lane, Champaign, Illinois, U.S. A. a 


Mrs. Sandra 8. White, 535 E. 72nd Street, New York 21, N. Y., U.S. A. 1. 
Mr. Amador D. Yniguez, Institute of Statistics, P. O. Box 5457, Raleigh, North nies 
Carolina, U. S. A. 
Dr. Sydney S. Y. Young, 68 Barker Road, Strathfield, N.S.W., Australia. e 
New Members 
At Large 
Mr. Pedro Eliz M. Revello, Instituto Venezolano Petroquim, Centro Simon Bolivar, . 
Caracas, Venezuela. 
Brazil 
Mr. Joassy de Paula Neves Jorge, Instituto Agronomico Caixa Postal 28, Campinas = 
S.P., Brazil. He 
| France 
| 
M. Pierre Lossois, 120 bis avenue de Verdun, Issy-les-Moulineaux (Seine) France. 
Germany 
Dipl. G. Enderlein, Karl-Marx-Str. 31, Klein Wenzleben bei, Magdeburg/DDR, 
Germany. 


Mr. Werner Haufe, Sophienstr. 21, Einbeck/Hann., Germany. 

Prof. Dr. A. Maede, Eichenweg 5, Halle/Saale, Germany. 

Mr. Dieter Rasch, Fritz Reite Str. 22, Rostock, Germany. 

Dr. Ernst Weber, Windhalmweg 36, Stuttgart-Plieninger, Germany. 

Doz. Dr. Hermann Witting, Dreikonigstrasse 9, Freiburg i. Br., Germany. 

Italy 

Dott. Giuseppe Agnese, Istituto di Igiene dell’Universita, via Pastore 1, Genova, 
Italy. 

Dr. Giancarlo Chisci, Stazione Sperimentale di Praticoltura, Viale Piacenza 25, Lodi, ] 
Milano, Italy. 

-Dr. Maurizio Turri, Viale O. P. Vigliano 17, Milano, Italy. 


Western North American Region 


Mr. Peter A. Dawson, Department of Genetics, University of California, Berkeley 4, 
California, U. S. A. 

Dr. Olive Jean Dunn, School of Public Health, University of California, Los Angeles 
24, California, U.S. A. 

Prof. Robert F. Fagot, Department of Psychology, University of Oregon, Eugene, 
Oregon, U. S. A. 

Dr. Robert Macey, Department of Physiology, University of California, Berkeley 4, 
California, U. S. A. 

Mr. Prem Singh Puri, 1845 Hearst Avenue, Berkeley 3, California, U. S. A. 

Mr. Kurt Sittman, Animal Husbandry Department, University of California, Davis, 
California, U.S. A. 

Mr. Albert R. Stage, 157 South Howard Street, Spokane 4, Washington, U.S. A. 
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NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary (if 
members at large, to the General Secretary) news of appointments, distinctions, or 
retirements, and announcements of professional interest. 


EDITORIAL NOTE 


The general objective of Biometrics is to promote and to extend the use of 
mathematical and statistical methods in pure and applied biological sciences. This 
objective has been broadly interpreted in editorial decisions and this is proper. 
However, it is believed also that mathematicians and statisticians contributing to 
Biometrics have an obligation to present their work so that the natures of the prob- 
lems in the biological sciences to which the methods apply are clearly understood 
by those who may profit by the use of the methods. Similarly, biologists should 
explain their problems so that their needs and methods are clearly understood by 
the mathematician and statistician. While these requirements may not be easy to 
méet, direct consideration of them should greatly increase the value of Biometrics 
to all members of the Biometric Society. Careful attention to introductions of 
papers, including formulation of problems and models, with skillful selection of 
examples would be helpful. When possible, algebraic and mathematical discussions 
should be placed in appendices to papers in order that they do not detract from the 
emphasis on the problem areas and the readability of papers. 


- MEETINGS OF E. N. A. R. 


The Eastern North American Region will meet jointly with the Institute of 
Mathematical Statistics and the American Statistical Association on April 20, 21, 
and 22, 1961, at Cornell University. Titles and abstracts, the latter in duplicate in 
the form published in Biometrics, of contributed papers for ENAR should be sent 
to Dr. Erwin L. LeClerg,. Biometrical Services, Plant Industry Station, Belts- 
ville, Maryland. 

In 1961, ENAR will also meet jointly with the American Institute for Biological 
Sciences at Purdue University and with the American Statistical Association in 
New York City. 


MEETING OF W.N.A:R. 
The Western North American Region will meet jointly with the Institute of 
Mathematical Statistics in June, 1961, at the University of Washington. 
UNIVERSITY ANNOUNCEMENTS 
THE CATHOLIC UNIVERSITY OF AMERICA 


The Statistical Laboratory of the Catholic University of America has been 
awarded a grant by the National Institutes of Health for training in the field of 
Biometry. 

The stipends for first-year graduate students are $2250 plus tuition; family 
allowances for dependents and annual increases are provided. 
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The students will pursue the same general program as other students in mathe- 
matical Statistics. They will participate in the consulting activities of the laboratory 
and will be required to attend some courses in the biological sciences or other fields 
relevant to the study of biometry. 

In addition to the grants in biometry, there are also fellowships under the 
National Defense Education Act available. Some appointments to graduate as- 
sistantships and research assistantships will also be made. 

The Statistical Laboratory of the Catholic University of America is also expand- 
ing its activities into the areas of biomathematics and biometry. A training program 
and consulting service are being organized. Professor Edward Batschelet (on leave 
from the University of Basel, Switzerland) was appointed Visiting Professor in 
this program for the academic year 1960-61. Professor Harold Bergstrom of the 
Institute of Applied Mathematics of Chalmers Institutes of Technology (Goteborg, 
Sweden) was appointed Visiting Professor for the academic year 1960-61. He will 
primarily be engaged in research in probability theory. Professor D. Dugue of the 
Sorbonne (Paris, France) and Dozent T. E. Dalenius of Stockholm University are 
expected to visit Catholic University during the spring term 1961. 

Requests for further information and application forms should be addressed 
to Professor Eugene Lukacs, Director, Statistical Laboratory, The Catholic Uni- 
versity of America, Washington 17, D. C. 


IOWA STATE UNIVERSITY 


The National Science Foundation will sponsor a Summer Institute for College 
Teachers of Statistics at Iowa State University for the 1l-week period from June 5 
through August 18, 1961. The Departments of Statistics of three other universities, 
Kansas State, Utah State and the University of Wyoming, are cooperating with 
Iowa State’s statistical center in presenting this institute. 

Financial support in the form of stipends, dependency allowances and travel 
allowances will be awarded to 50 eligible applicants. All American college and 
university teachers who are, or who during the 1961-62 academic year will be, re- 
quired to teach one or more courses in statistics as part of their regular assignments 
are eligible for consideration. 

The institute is planned to provide additional basic training in statistics for 
present and prospective teachers who, though well-grounded in other fields, have 
limited backgrounds in statistics. Also it will provide more advanced courses and 
seminars designed to keep college and university teachers abreast of new develop- 
ments. 

Courses are scheduled in Statistical Methods, Theory of Statistics, Experi- 
mental Design, Survey Designs, Topics in Foundations of Probability and Statistics, 
and Intermediate Applied Decision Theory. In addition, an opportunity will be 
provided for those interested to observe a demonstration class in Principles of Sta- 
tistics at the undergraduste level. The faculty will include the institute director, 
Dr. T. A. Bancroft, Director of the Iowa State University Statistical Laboratory 
and Head, Department of Statistics; Dr. R. J. Buehler, Associate Professor of 
Statistics, Iowa State University; Dr. H. T. David, Associate Professor of Statistics, 
Iowa State University; Dr. H. C. Fryer, Head of the Department of Statistics and 
Statistical Laboratory Director, Kansas State University; Dr. H. O. Hartley, Pro- 
fessor of Statistics, Iowa State University; the Institute Associate Director, Dr. 
D. V. Huntsberger, Associate Professor of Statistics, Iowa State University; and 
Dr. R. L. Hurst, Head of the Department of Applied Statistics and Statistical 
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Laboratory Director, Utah State University. Guest lecturers will present a series 
of special seminars. 

Requests for information or application forms should be addressed to: The 
Director, Summer Institute in Statistics, 102 Service Building, Iowa State Uni- 
versity, Ames, Iowa. 

The Department of Statistics at Iowa State University will offer also eight 
applied courses in statistical theory and methods in its two 1961 summer sessions. 
These courses are planned primarily for graduate students or research workers with 
limited mathematical backgrounds who wish to use statistical techniques intelli- 
gently for application to other fields. In addition, a course on special topics in 
theoretical or applied statistics may be studied at the graduate level. Senior staff 
members will be available during most of the summer for consultations on research 
or special problems. 

Students may register for either or both of the six-week summer sessions: June 
5-July 12 and July 12-August 18. The complete list of statistics offerings for the 
first session is as follows: Stat. 401, “Statistical Methods for Research Workers 
“(at the level of Snedecor’s Statistical Methods); Stat. 447, ‘Statistical Theory for 
Research Workers’”’ (mainly theory of experimental statistics at the level of Ander- 
son and Bancroft’s ‘Statistical Theory in Research’’; Stat. 411, ‘‘“Experimetnal De- 
signs for Research Workers,”’ Stat. 599, “Special Topics;”’, Stat. 599A1, ‘Topics in 
Foundations of Probability and Statistics; and Stat. 699, ‘Research.’ In the 
second session will be offered Stat. 402, a continuation of 401, Stat. 448, a continu- 
ation of 447; Stat. 421, “Survey Designs for Research Workers;” Stat. 599, Stat. 
599A2, “Intermediate Applied Decision Theory (at the level of Blackwell and 
Girshick, Theory of Games and Statistical Decisions), and Stat. 699. 


JOHNS HOPKINS UNIVERSITY 


Beginning next September, the Department of Biostatistics at The Johns 
Hopkins University will offer an expanded program of study and research leading 
to Master of Science and Doctor of Science degrees. The curriculum has been modi- 
fied by increasing the scope of the basic courses in statistical theory and statistical 
methods and by the addition of specialized courses including least squares and 
regression, stochastic processes, nonparametric methods, sampling and survey 
methods, biological assay, design of experiments and digital computer. programming. 
These changes reflect a realization of the need for more intensively trained statis- 
ticians in the areas of biology, medicine and public health, and they are the direct 
result of the increasingly important role played by mathematics and statistics in 
all areas of scientific research. 

The new program was made possible in part by the recent addition to the de- 
partment of Drs. Allyn W. Kimball and David B. Duncan who together with Drs. 
Helen Abbey, Earl Diamond, John J. Gart and Margaret Martin form the perma- 
nent staff. In September, 1961, Dr. Norman T. J. Bailey of Oxford University will 
join the staff as visiting professor and will teach the course in stochastic processes. 
Additional appointments may be announced shortly. 

The department has a limited number of liberal fellowships available, and 
interested students are invited to write to the Chairman, Department of Biosta- 
tistics, 615 North Wolfe Street, Baltimore 5, Md. for further information. 


PURDUE UNIVERSITY 
There will be three intensive courses in the general areas of statistical methods 
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and quality control, design of experiments, and operations research at Purdue Uni- 
versity this summer. 

The course in Statistical Methods and Advanced Quality Control has been 
given at Purdue annually since 1947, being most recently revised in 1960. This 
course is designed for those who have had the equivalent of one of the intensive 
courses in statistical quality control given during and after the war, and who want 
to learn more about the statistical approach to industrial and research problems. 

Topics to be studied during the ten-day quality control course are Significance 
Tests and Confidence Intervals, Significance of Differences, Linear Correlation and 
Regression, Single Sampling for Measurements, Sequential Sampling for Measure- 
ments, Multiple Correlation, and Analysis of Variance. Instructors include Profs. 
Irving W. Burr, the course director, and Charles R. Hicks, both of Purdue, Cecil 
C. Craig of the University of Michigan and Gayle McElrath of the University of 
Minnesota. The dates for this course are September 5-15. 

The second course is on Design of Experiments and is for statisticians, quality 
control personnel, engineers, and others concerned with planning, analyzing, and 
interpreting the results of industrial experiments. The dates for this course are 
June 7-17. This will be the third year this course has been offered. 

Professor Hicks, of the mathematical and statistical staff, the course director, 
emphasizes that this advanced course is for persons who have had previous statisti- 
cal training, including work on tests of hypotheses, linear correlation, and at least an 
introduction to analysis of variance. 

Topics included in the short course are Review of Analysis of Variance, Princi- 
ples of Experimental Design, Variance Component Analysis, Randomized Blocks 
and Latin Squares, Factorial Experiments, Split Plot Designs, Confounding in 
Factorial Experiments, Incomplete Block Designs, Fractional Replications, and 
Introduction to Evolutionary Operations (EVOP). 

In addition to Professor Hicks, instructors will include Prof. Clyde Y. Kramer, 
Virginia Polytechnic Institute, and Professor McKirath. Enrollment will be limited 
to a maximum of 25-30. 

The third ten-day short course is on The Mathematical Techniques of Oper 
ations Research which will be offered at Purdue University for the second time this 
summer on June 5-15. 

The course has been designed for statisticians, quality control analysis, engi- 
neers, and other technical personnel in industrial and management positions. Empha- 
sis will be placed on the mathematical techniques of operations research and the 
application of these methods to current industrial and military problems. 

These methods involve the construction of mathematical models representing 
the operation of industrial management or a military organization, and suggest 
the best solutions to problems involved. Among the topics to be discussed during 
the course are inventory control models, waiting line models, linear programming, 
simplex method, transportation methods, production scheduling models, search 
theory, cost-effectiveness studies, and systems analysis. 

Instructors for the short course are Prof. Paul Randolph of Purdue, who is the 
short course director, Albert Madansky, of the RAND Corporation, and Prof. 
Bernard Lindgren, of the University of Minnesota. Enrollment in this course will 
be limited. 

All three courses are sponsored jointly by the Statistical Laboratory and the 
Division of Adult Education at Purdue. Further intormation about any of the 
courses may be obtained by writing to the Division of Adult Education, Purdue 
University, Lafayette, Indiana. 
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SOUTHERN METHODIST UNIVERSITY 


The National Science Foundation has awarded Southern Methodist University 
a grant of $26,400 to conduct a basic study of the feasibility of making the new 
Science Information Center at SMU a regional scientific and technical information 
center serving the needs of Southwestern industry and higher education. 

Dr. William J. Graff, Jr., chairman of the SMU Department of Mechanical 
Engineering, is principal investigator for the study. He is assisted by Sam G. 
Whitten, SMU Science Librarian. 

The building to house the new Science Information Center, a gift of local 
industrialists, is under construction and is scheduled for completion next August. 
A million dollar structure consisting of 80,000 sq. ft. on four floors, it will have 
space for half a million books. It will serve the SMU faculty and student body as 
well as the new Graduate Research Center. Space is provided in the building for 
administrative offices of the Graduate Research Center, whose director is Dr. Lloyd 
Berkner; for the SMU Map Library and Herbarium; and for the DeGolyer Geology 
Collection, in addition to the general document collections in science, mathematics, 
and engineering. 

The study sponsored by the National Science Foundation is scheduled for com- 
pletion in May, 1961. It will be based to a large extent on personal interviews and 
questionnaires aimed at the scientists and technicians who are engaged in research 
in the Southwest. An attempt will be made to find out what kind of information the 
research people need and in what subjects. 

Also involved in the study are visits to and detailed studies of existing scientific 
information centers and an economic study of the area done by business economists. 

The investigators expect to reach many research workers by questionnaires 
sent through local professional and scientific and technical societies, and will ask 
for employers aid in setting up personal interviews with research workers. 


UNIVERSITY OF MINNESOTA 


A special summer program of statistics in the Health Sciences will be held 
from June 13 to July 28, 1961 at the University of Minnesota. Courses in diverse 
areas of statistics at elementary and advanced levels will be given. Experts from 
around the nation will be teaching. Stipends are available to qualified students. 
For further information write to Biostatistics, 1226 Mayo, University of Minnesota, 
Minneapolis 14, Minnesota. 


VIRGINIA POLYTECHNIC INSTITUTE 


The 1961 session of the Southern Regional Graduate Summer Session in Sta- 
tistics will be held at the Virginia Polytechnic Institute, Blacksburg, Virginia, from 
June 15 to July 22, 1961. 

The Virginia Polytechnic Institute, Oklahoma State University, North Caro- 
lina State College, and the University of Florida have agreed to operate a continuing 
program of graduate summer sessions in statistics to be held at each institution in 
rotation. The program was instituted at Virginia Polytechnic Institute in the Sum- 
mer of 1954. 

The 1961 session, like previous sessions under this program, is intended to 
serve: 1) teachers of statistics and mathematics; 2) professional workers who want 
formal training in modern statistics; 3) research and engineering personnel who 
want intensive instruction in basic statistical concepts and modern statistical meth- 
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odology; 4) Public Health statisticians who wish to keep informed about advanced 
specialized theory and methods; 5) prospective candidates for graduate degrees in 
statistics; and 6) graduate students in other fields who desire supporting work in 
statistics. 

The session will last six weeks and courses will carry five quarter hours of credit. 
Not more than two courses may be taken for credit at any one session. The summer 
work in statistics may be applied as residence credit at any of the cooperating insti- 
tutions, as well as certain other universities, in partial fulfillment of the require- 
ments for a graduate degree. The program may be entered at any session, and 
consecutive courses will follow in successive summers so that it would be possible 
for a student to complete the course work in statistics for a Master’s degree in three 
summers. Students must satisfy the remaining requirements for course work 
and thesis at the institution where they are to be admitted to candidacy. The 
advanced courses may be accepted as part of the Ph.D. program of the participating 
institutions. 

A limited number of fellowships will be available for applicants from certain 
specialized areas. Doctoral courtesy will be honored for those holding Ph.D. or 
M.D. degrees. 

The courses to be offered in statistics in 1961 at the Virginia Polytechnic Insti- 
tute are as follows: Statistical Methods, Sampling Theory; Applied Statistics for 
Engineers and Physical Sciences; Theory I, Probability; Theory II, Statistical 
Inference; Theory III, Theory of Least Squares; Principles and Practices of High 
Speed Computing (enrollment limited to 14); Non-parametric Methods; and Multi- 
variate Methods. 

A number of courses in advanced mathematics will be available during the 
Summer Session. For a complete listing please consult the University timetable. 
A series of Colloquia involving recent developments in statistical theory and meth- 
odg will be conducted during the special Summer Session. 

Requests for application blanks for the summer session and for fellowships 
should be addressed to Dr. Boyd Harshbarger, Head, Department of Statistics, 
Virginia Polytechnic Institute, Blacksburg, Virginia. 


CONTINENTAL CLASSROOM 


Probability and Statistics, a nationally televised college-credit mathematics 
course, will be offered on the second semester of Continental Classroom beginning 
January 30, 1961. 

Presented by Learning Resources Institute in cooperation with the Conference 
Board of the Mathematical Sciences, and telecast in color and black-and-white 
by the National Broadcasting Company, the course will be taught by Professor 
Frederick Mosteller, Chairman, Department of Statistics, Harvard University, and 
Professor Paul C. Clifford, Professor of Mathematies, Montclair (N. J.) State College. 

More than 300 colleges and universities are expected to offer the televised 
course for college credit. It will be carried by 170 stations throughout the nation, 
from 6:30 to 7 a.m. Monday through Friday in each time zone. 


NEW JOURNALS 
Journal of the Forensic Science Society j 


Stuart S. Kind is the editor of a new journal, The Journal of the Forensic Science 
Society with editorial office c/o Rossett Holt, Pennal Ash Road, Harrogate, York- 
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shire, England. Papers on biometry, especially in relation to human individuality, 
classification, and identification systems, will be welcomed. 


Crop Science 


Crop Science is the name of the new research journal which will appear in 
February 1961 as the official publication of the Crop Science Society of America, 
an affiliate of the American Society of Agronomy. 

This new, bimonthly journal will carry research reports on breeding, genetics, 
physiology, ecology, and management of field crops, pastures, ranges, and turf- 
grasses from crop scientists in the U. S., Canada, and other countries. 

Crop Science will be a companion publication to Agronomy Journal, the official 
organ of the ASA. It will alternate in publication dates with the Journal and will 
carry the articles (formerly publishable in the Journal) of direct interest to workers 
in the above-mentioned areas of research. Agronomy Journal will continue to carry 
the articles of wider agronomic scope—of interest to both crop and soil scientists, 
seed and weed technologists, plant pathologists, agricultural meteorologists, and 
others. 

Publication of research reports in Crop Science will be open to members of the 
Crop Science Society—with joint membership in the American Society of Agronomy. 
Detailed information on publication in gr subscription to Crop Science or Agronomy 
Journal or both may be obtained from the American Society of Agronomy, 2702 
Monroe Street, Madison 5, Wisconsin. 


TRAVEL GRANTS FOR ATTENDANCE AT THE INTERNATIONAL 
CONGRESS OF MATHEMATICIANS 


Travel grants will be made to a number of mathematicians who wish to attend 
the International Congress of Mathematicians in Stockholm, on August 15-22, 
1962. It is hoped that funds available through various sources may provide travel 
assistance for a considerable number of mathematicians. 

There will be a greater effort than in the past to give aid to younger people. 
As grants will be made only to those who have filed spplications, it is urgent that 
any who wish to receive a grant should fill out and file an application. Younger 
people are urged to file applications so that their cases can be considered. Appli- 
cations can be obtained from the Division of Mathematics, National Academy of 
Sciences, National Research Council, Washington 25, D. C. by requesting an appli- 
cation for a travel grant to the 1962 International Congress. 

The deadline for filing of applications is November 1, 1961, and an attempt 
will be made to announce the grants by January 1, 1962. 

Awarding of grants will be made only to those persons whose applications have 
been received, in good order, by November 1. The selection will be made by a com- 
mittee consisting of the regular Committee on Travel Grants of the Division of 
Mathematics of the National Academy of Sciences—National Research Council 
enlarged to include representatives of societies affiliated with the Division and 
representatives of various governmental agencies. 


J NEWS ABOUT MEMBERS 
Laurence H. Baker is presently employed by Hy-Line Poultry Farms as a geneti- 


cist in Des_Moines, Towa. 
David Bruce is Chief of Division Forest Fire Research, Pacific Northwest 
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Forest and Range Experiment Station, USDA. He formerly held the same position 
at the Southern Forest Experiment Station. 

Martha W. Dicks is presently Assistant Professor of Human Nutrition at 
Montana State College. 

Charles W. Dunnett was awarded a D.Sc. degree in statistics on completion of 
a two-year period of research at the University of Aberdeen, Scotland, under Dr. 
D. J. Finney, F.R.S. The title of his thesis was ‘“The Statistical Theory of Drug 
Screening.” He has now returned to his position as Head of the Statistical Design 
and Analysis Department at the Lederle Laboratories Division of the American 
Cyanamid Company, Pearl River, New York. 

Lincoln J. Gerende is presently Instructor in Biostatistics, Department of 
Epidemiology and Public Health, Medical School, Yale University. 

Leo A. Goodman is a Visiting Professor of Mathematical Statistics and Soci- 
ology at Columbia University during 1960-61 on leave from his position at the 
University of Chicago. 

John Gurland, on leave from Iowa State University, is presently employed at 
the Mathematics Research Center of the University of Wisconsin in Madison, Wis- 
consin. 

James A. Hagans of the Department of Preventive Medicine and Public Health 
of the University of Oklahoma has recently received his Ph.D. degree from that 
institution. 

Carl E. Hopkins is now special consultant (Air Pollution Medical Studies) to 
the School of Public Health, UCLA. 

Leo Katz has returned to his position as Head of the Department of Statistics, 
Michigan State University. He was on leave of absence serving as Scientific Liaison 
Officer for the Office of Navy Research in London, England, covering statistics and 
probability in Europe. 

George H. Kennedy is presently employed by the Department of Health, 
Education and Welfare, U. S. Public Health Service in Bethesda, Maryland. He 
was formerly at the Biological Laboratories of the Department of Defense at Fort 
Detrick, Maryland. 

John R. Kinzer has recently returned to his position of Professor of Psychology 
at Ohio State University, Columbia. He had been on a two-year leave of absence 
to work at the System Development Corporation, Santa Monica. 

Gary F. Krause has given up his post as Instructor in Statistics at Kansas 
State University to become a full time graduate student in Statistics at the Virginia 
Polytechnic Institute at Blacksburg, Virginia. 

Dr. Kuo Hua Lee has taken a position as an Associate Professor of Biostatistics 
in the University of Oregon Dental School, Portland. 

William T. Lewish has taken a position as Special Service Engineer for the E. I. 
DuPont de Nemours and Co. in Wilmington, Delaware. 

Alexander M. Mood formerly president of General Analysis Corporation be- 
came Vice-President of C-E-I-R, Inc. and Manager of the Los Angeles Research 
Center of C-E-I-R, when General Analysis Corporation merged with C-E-I-R. 

Wesley L. Nicholson is on a four-month-special assignment as Professor of 
Mathematics at the University of Idaho. His permanent post is senior statistician 
for the General Electric Company in Richland, Washington. 

L. M. Potter, formerly Research Associate in the Department of Poultry 
Science at the University of Connecticut, is now Associate Professor in the Depart- 
ment of Poultry Husbandry at the Virginia Polytechnic Institute at Blacksburg, 
Virgini 
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Mr. J. N. K. Rao, an advanced graduate student in Statistics at Iowa State 
University, has been selected as the recipient of the George W. Snedecor Award in 
Statistics, which consists of a cash prize of $25.00, a year’s membership in the Insti- 
tute of Mathematical Statistics and a year’s subscription to the Annals of Mathe- 

Maynard W. Shelly, II, formerly a psychologist in the Office of Naval Research 
is now a mathematician in the Logistics and Mathematical Statistics Branch of 
ONR. 

John H. Smith is a Visiting Professor at the University of Chicago during the 
year. He is on sabbatical leave from the American University. 

Jerry Warren is presently Assistant Professor in the Horticulture Department 
of the University of Nebraska. He was formerly with the Vegetable Crops Depart- 
ment of Cornell University. 
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TABLE OF CONTENTS _ 
A Reproducible Method of Counting Persons of Spanish 


Multiple Comparisons Among Means...............++-- Ouive JEAN DUNN 
A Parametric Estimate of the Standard Error of the 

Probability Table for Number of Runs of Signs of First 

Differences in Ordered Evucene S. Epginaton 


Residual Analysis 
J. Freunp, RicHarp W. Jr. ano C. W. CLuntes-Ross 


Note on Stepwise Least Squares..ArTHUR S. GoLpBERGER AND Dirk JocHEMS 


Some Nonparametric Tests for Comovements Between 
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Almost Linearly-Optimum Combination of Unbiased 
Cooperation Among Statistical and Other Societies. .... Morris H. Hansen 
A Note on Follow-Up for Survival in the eK of 
. J. THOMPSON AND D. Kopin 
On Some Measures of Food Marketing Georce W. Lapp 


On Stabilizing the Binomial and Negative Binomial 


Unbiased Ratio Estimators in Stratified Sampling. ...Jose Nieto pe PascaaL 
The Probability of Reversal Associated with a Test. Procedure when Data 


Are Incomplete............. BerNnarp S. PASTERNACK AND JUNJIRO OGAWA 
Some Aspects of Seasonality in the Consumer Price Index...... H. E. Ritey 
Variance Estimates in “Optimum” Sample Designs............. ALAN Ross 
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A Journal of Statistics for the 
Physical, Chemical and Engineering Sciences 


Vol. 2, No. 4 November, 1960 
CONTENTS 

Statistical Life Test Acceptance Procedures............ BENJAMIN EPSTEIN 

Estimation from Life Test BENJAMIN EpsTEIN 


Some New Three Level Designs for the em. of Quantitative Variables 
G. E. P. Box anp D. W. BEHNKEN 
Graphical Procedure for Fitting the Best Line to a Set of Points 
L. Dotsy 
Tables of Tolerance-Limit Factors for Normal Distributions 
ALFRED WEISSBERG AND GLENN H. Beatty 
On the Evaluation of the Negative Binomial Distribution with Examples 


. P. Patm 

On Methods of open Sets of hag 4 Orthogonal Latin Squares 

Using a Computer...... R. C. Bose, I. M. Coakravarti AND D. E. KNuTH 

Notices 

Vol. 3, No. 1 February, 1961 

CONTENTS 


Average Run Lengths in Cumulative os Quality Control Schemes 
P. L. GotpsmitTH AND H. WHITFIELD 
Prediction Regions for Several Predictions from a - Regression Line 
GeraLp J. LigBERMAN 
The Robustness of Certain Life Testing Procedures Derived from the 
Exponential Distribution...... Marvin ZELEN AND Mary C. DANNEMILLER 
An Application of a Balanced Incomplete Block Design. . Peter W. M. JoHN 
Multi-component Systems and Structures and Their Reliability 
Z. W. BrrnsauM, J. D. Esary ann S. C. SAUNDERS 
An Asymptotic Distribution for an Occupancy Problem with Statistical 
M. Havrerin anp G. L. Burrows 
Outliers in Patterned Experiments: A Rae Appraisal 
Irwin D. J. Bross 


AcHueson J. DuNcAN AND WILLIAM BERANEK 
Statistical ieee for High Speed Computers 
Notices 


Technometrics is published quarterly in February, May, August, and 
November. The annual non-member subscription rate is $8.00. To members 
of the American Statistical Association and the American Society for Quality 
Control the rate is $6.00. Checks should be made payable to Technometrics 
and addressed to ee Post Office Box 587, Benjamin Franklin 
Station, Washington 6, D 
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INTERNATIONAL JOURNAL OF ABSTRACTS 
STATISTICAL THEORY AND METHOD 


A Journal of the International Statistical Institute 


The aim of this journal of abstracts is to give complete coverage 
of published papers in the field of statistical theory (including 
associated aspects of probability and other mathematical methods) 
and new published contributions to statistical method. 


All contributions in the following five journals—being wholly 
devoted to this field—are abstracted: Annals of Mathematical 
Statistics; Biometrika; Journal, Royal Statistical Society (Series 
B); Bulletin of Mathematical Statistics; Annals, Institute of Sta- 
tistical Mathematics; and a further group of six journals are ab- 
stracted on a virtually complete basis as follows: Biometrics; 
Metrika; Metron; Review, International Statistical Institute; 
Technometrics; Sankhyé. There are about 250 other journals 
partly devoted to statistical theory and method from which the 
appropriate papers are abstracted. 


The abstracts are about 400 words long—the recommendation 
of UNESCO for the “long” abstract service: they are in the Eng- 
lish language although the original language of the paper is 
noted on the abstract together with the name of abstractor. In 
addition to the address of the author(s) are given in detail to 
facilitate contact in order to obtain further detail or request an 
off-print. The journal is published quarterly and contains ap- 
proximately 1000 abstracts per year. 


A scheme of classification has been developed for the abstracts 
that is flexible and facilitates the transfer of code numbers to 
punched cards. A unique aspect of this journal is that the pages 
are colour-tinted according to the main sections of classification. 
This method of colour-coding the pages provides a distinctive 
and powerful visual aid in the identification of abstracts in what- 
ever manner the journal is filed for reference. 


Annual Subscription £5 (U.S.A. and Canada $16.00) 
Single Number 30s (U.S.A. and Canada $4.50) 


OLIVER AND BOYD LTD. 
Tweeddale Court, 14 High Street, Edinburg, 1 
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INFORMATION FOR CONTRIBUTORS 


MANUSCRIPTS 


Contributions for Biometrics may be addressed to Dr. Ralph A. Bradley, Depart- 
ment of Statistics, The Florida State University, Tallahassee, Florida, U.S.A.; 
authors residing in the following Society Regions can expedite consideration of papers . 
by submitting them to the appropriate Associate Editor, namely; BRITISH RE- 
GION: Dr. 8. C. Pearce, East Malling Research Station, East Malling, Maidstone, 
Kent, England; AUSTRALASIAN REGION: Dr. E. A. Cornish, University of 
Adelaide, Adelaide, Australia; FRENCH REGION: Dr. Georges Teissier, Faculté 
des Sciences de Paris, 1 rue V. Cousin, Paris, France. QUERIES, NOTES, and 
related correspondence should be directed to Dr. D. J. Finney, Department of 
Statistics, University of Aberdeen, Meston Walk, Old Aberdeen, Scotland. Books 
and materia: for Book Reviews should be sent to Mr. J. G. Skellam, The Nature 
Conservancy, 19 Belgrave Square, London, 8.W. 1, England. 

MANUSCRIPTS must be submitted in triplicate, with typescript doublespaced 
throughout. Marginal notes may obviate typographical difficulties presented by 
complicated formulae or tables—authors should not attempt editorial instructions 
or markings for the printer. TABLES should be identified by arabic number and 
by a short descriptive title. ILLUSTRATIONS should also be identified by arabic 
number and by a brief caption. (Captions should not be included in illustrations, 
but should be typewritten collectively on an accompanying sheet.) Originals 
should be approximately 8.5 x 11 in. (21.5 x 28 cm.). The original of each chart, 
diagram, or graph should be executed in black on white drawing paper or board, on 
blue tracing linen, or on coordinate paper ruled in blue only; coordinate lines to be 
reproduced should be ruled in black. For printing, illustrations may be reduced to 
¥ or ¥ original dimensions. Lines should therefore be of sufficient thickness, and 
decimal points, periods, and stippled dots should be solid black circles large enough 
to reproduce well. Lettering and numerals should be at least 1 mm. high when 
reproduced in a cut 3 in. (7.5 cm.) wide. Photographs should be prints on glossy 
paper with strong contrasts, and if grouped in a plate should be mounted ,géntig- 
uously. All tables and illustrations should be mentioned explicitly inthe text. 
REFERENCES (BIBLIOGRAPHIC) should be collectively listed slphabyptally 
by author; textual citation by author and year is preferred. 


ABSTRACTS é 
Abstracts of papers presented at meetings of the Biometric Society or of its 
regions are printed in Biometrics following such meetings. They should be submitted 
to the person designated to receive them for a particular meeting in exactly the form 
published in Biometrics (except for an Abstract Number), doublespaced on bond 
paper, and in duplicate. Use of formulae requiring display printing is to be avoided. 


Notices, ANNOUNCEMENTS, AND Biometric Society Reports 


International and regional reports and notices should be submitted by the 
appropriate officers of the Society and its Regions in duplicate doublespaced on 
separate sheets exactly as they are to be printed in Biometrics. Other material to 
be printed in News and Announcements should also be submitted doublespaced 
and in duplicate. 


SustTaininc MEMBERS OF THE Biometric Society 
Abbott Laboratories 
American Cancer Society, Inc. 
General Foods Corporation, Research Center 
Heisdorf and Nelson Farms, Inc. 
Merck, Sharp and Dohme Research Laboratories 
Schering Corporation 
Smith, Kline and French Laboratories 
E. R. Squibb and Sons 
The Upjohn Company 
Wallace Laboratories, Division of Carter Products 
Wyeth Institute of Applied Biochemistry 
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BACK ISSUES 


Back issues of Biometrics are available at the following postage-paid 
prices in U.S.A. currency: 


Price per Price per 
Year Volume Number Single Number Volume (unbound) 


1945 1 1 to6 $1.00 $6.00 
1946 2 1to6 1.00 6.00 
1947 3 1to4 1.50 5.00 
1948 4 lto4 1.50 5.00 
1949 5 1lto4 1.50 5.00 
1950 6 1to4 1.50 5.00 
1951 7 1to4 2.00 8.00 
1952 8 1lto4 2.00 8.00 
1953 9 lto4 2.00 8.00 
1954 10 1lto4 2.00 8.00 
1955 11 lto4 2.00 8.00 
1956 12 lto4 2.00 8.00 
1957 13 1to4 2.00 8.00 
1958 14 1lto4 2.00 8.00 
1959 15 lto4 2.00 8.00 
1960 16 lto4 2.00 8.00 


Reprints of individual articles are not available except to authors at the 
time of printing. Three special issues are among the numbers listed 
above. They are: 


1947 Volume 3 Number 1 The Analysis of Variance 
1951 Volume 7 Number 1 Components of Variance 
1957 Volume 13 Number 3 The Analysis of Covariance 


Also available are: 
Fishery Reprint Series (Selected reprints from Vol. 5) $1.00 
Subject Index (Volumes 1-10) 1.00 
Proceedings, International Biometric Symposium, 
Campinas, Brazil, 1955. 1.00 


Inquiries, non-member subscriptions, and orders for back issues and 
other material listed above should be addressed to: Biomerrics, DEPART- 
MENT oF Sratistics, THe Fiorma Strate UNIversITY, TALLAHASSEE, 
Froripa, U.S.A. 
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